GENSCAN 1.0 Date run: 8-Nov-116 Time: 01:17:03 Sequence gi568815591f:130280521_130488008 : 207488 bp : 44.12% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1584 1668 85 2 1 81 84 49 0.723 3.19 1.02 Intr + 9040 9206 167 2 2 89 9 217 0.027 13.58 1.03 Intr + 10937 11021 85 1 1 88 59 59 0.025 2.39 1.04 Intr + 15072 15141 70 1 1 76 41 53 0.021 -2.36 1.05 Intr + 18226 18307 82 1 1 122 77 43 0.997 6.24 1.06 Intr + 18750 18884 135 2 0 109 96 99 0.999 13.56 1.07 Intr + 20296 20394 99 0 0 101 87 96 0.990 11.11 1.08 Intr + 23774 23884 111 1 0 36 94 45 0.574 0.48 1.09 Intr + 23958 24059 102 2 0 60 100 106 0.992 9.37 1.10 Intr + 26267 26377 111 1 0 53 89 69 0.930 4.08 1.11 Intr + 27787 27877 91 0 1 85 96 76 0.999 7.67 1.12 Intr + 30198 30466 269 1 2 104 80 281 0.819 26.15 1.13 Intr + 31518 31602 85 2 1 79 84 56 0.622 3.69 1.14 Term + 41969 42156 188 0 2 47 48 271 0.697 16.65 1.15 PlyA + 43637 43642 6 1.05 2.00 Prom + 45764 45803 40 -3.86 2.01 Init + 62305 62387 83 0 2 42 98 87 0.895 5.54 2.02 Intr + 67246 67327 82 1 1 93 100 104 0.880 11.74 2.03 Intr + 69455 69734 280 1 1 38 55 132 0.416 1.95 2.04 Intr + 72726 72983 258 1 0 91 42 103 0.097 3.43 2.05 Intr + 75975 76048 74 1 2 83 40 38 0.122 -2.37 2.06 Intr + 79069 79167 99 0 0 70 92 131 0.501 12.01 2.07 Intr + 80623 80724 102 0 0 122 93 75 0.998 11.77 2.08 Intr + 81918 82019 102 2 0 38 82 110 0.974 5.77 2.09 Intr + 82364 82474 111 1 0 97 77 139 0.586 14.28 2.10 Intr + 82899 82989 91 2 1 71 64 136 0.516 9.07 2.11 Intr + 86852 87051 200 0 2 82 77 190 0.997 16.37 2.12 Intr + 87386 87470 85 1 1 97 65 114 0.996 9.39 2.13 Term + 87890 88077 188 0 2 65 43 400 0.968 30.85 2.14 PlyA + 88188 88193 6 1.05 3.03 PlyA - 88388 88383 6 -5.22 3.02 Term - 88817 88628 190 1 1 63 41 199 0.856 9.62 3.01 Init - 91005 90932 74 0 2 42 78 71 0.454 2.04 3.00 Prom - 92689 92650 40 -4.86 4.00 Prom + 96525 96564 40 -5.46 4.01 Init + 99214 99385 172 0 1 58 96 144 0.591 9.81 4.02 Intr + 99764 99830 67 0 1 81 76 -58 0.333 -9.74 4.03 Intr + 99997 100065 69 1 0 106 37 57 0.336 0.70 4.04 Intr + 100578 100659 82 0 1 81 71 154 0.747 12.64 4.05 Intr + 101110 101343 234 0 0 116 78 441 0.887 43.79 4.06 Intr + 101588 101689 102 1 0 65 87 199 0.796 17.87 4.07 Intr + 102871 102972 102 0 0 114 116 96 0.999 15.37 4.08 Intr + 103164 103274 111 2 0 111 38 254 0.999 23.28 4.09 Intr + 104016 104106 91 2 1 94 50 94 0.999 5.77 4.10 Intr + 104626 104825 200 2 2 100 70 321 0.997 30.57 4.11 Intr + 105319 105403 85 0 1 73 78 124 0.984 9.29 4.12 Term + 107304 107491 188 1 2 93 49 360 0.999 30.25 4.13 PlyA + 107546 107551 6 1.05 5.12 PlyA - 108249 108244 6 1.05 5.11 Term - 118519 118371 149 2 2 123 43 69 0.424 3.96 5.10 Intr - 119986 119519 468 2 0 63 57 127 0.406 0.17 5.09 Intr - 120301 120187 115 1 1 32 83 182 0.875 12.12 5.08 Intr - 120520 120432 89 2 2 40 70 12 0.847 -5.71 5.07 Intr - 122279 122128 152 1 2 113 94 136 0.934 16.51 5.06 Intr - 124188 124044 145 1 1 38 63 126 0.851 4.44 5.05 Intr - 130671 130602 70 0 1 73 90 33 0.549 0.75 5.04 Intr - 131720 131659 62 0 2 58 119 16 0.661 0.15 5.03 Intr - 136446 136399 48 1 0 101 106 40 0.261 5.75 5.02 Intr - 147498 147435 64 0 1 78 105 24 0.212 1.39 5.01 Init - 173078 172974 105 2 0 54 99 120 0.778 9.92 5.00 Prom - 176993 176954 40 -3.86 6.00 Prom + 178524 178563 40 -6.76 6.01 Init + 180571 180661 91 0 1 54 95 99 0.933 7.86 6.02 Intr + 194195 194260 66 0 0 52 71 71 0.074 0.68 6.03 Intr + 197392 197497 106 2 1 137 38 48 0.173 3.97 6.04 Term + 201326 201452 127 2 1 49 54 78 0.097 -1.74 6.05 PlyA + 202014 202019 6 1.05 7.02 PlyA - 202568 202563 6 1.05 7.01 Term - 205866 205565 302 1 2 76 39 171 0.433 6.38 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 9040 9227 188 2 2 89 49 236 0.972 17.45 S.002 Init + 12661 12728 68 0 2 91 106 1 0.835 2.76 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:130280521_130488008|GENSCAN_predicted_peptide_1|559_aa SEVAQKAAQSLRSLHGTKYKVGPICSVIYQASGGSIDWSYDYGIKYSFAFELRDTGRYGF LLPARQILPTAEETWLGLKAIMEHIWVSVVLVADFKADIDLSDAEPTLCGGQDKDSEGKK KAALRIAPRNVLPPFRDQVLRINVRNGDEISKLSQLVNSNNLKLNFWKSPSSFNRPVDVL VPSVSLQAFKSFLRSQGLEYAVTIEDLQALLDNEDDEMQHNEGQERSSNNFNYGAYHSLE AKIVEKDYSGRSHNQRFHADNMVNSKIRVPEEIVKINKIYHEMDNIAADFPDLARRVKIG HSFENRPMYVLKIVSDYQRDPAITSILEKMDIFLLPVANPDGYVYTQTQNRLWRKTRSRN PGSSCIGADPNRNWNASFAGGASCISVFLDYELMFVLGLSPTGKGASDNPCSEVYHGPHA NSEVEVKSVVDFIQKHGNFKGFIDLHSYSQLLMYPYGYSVKKAPDAEELDKVARLAAKAL ASVSGTEYQVGPTCTTVYPASGSSIDWAYDNGIKFAFTFELRDTGTYGFLLPANQIIPTA EETWLGLKTIMEHVRDNLY >gi568815591f:130280521_130488008|GENSCAN_predicted_CDS_1|1680_bp agtgaagtggcccaaaaggctgcccaatctctgagaagcctgcatggcaccaagtacaaa gtgggaccaatctgctctgtcatctaccaagccagtggaggaagcattgactggtcctat gattatggcatcaagtactcatttgcctttgaactgagagacacagggcgctacggcttc ctcttgccagcccgtcagatcctgcccacagccgaggagacctggcttggcttgaaggca atcatggagcatatttgggtctcagttgttcttgttgctgacttcaaagctgacatcgat ctcagtgacgctgagcccacgctttgtggtggacaggacaaggacagtgagggcaagaaa aaggctgcccttcgcattgcaccccgcaatgtcctccctcctttcagggaccaagttttg aggattaatgtcagaaatggagacgagatcagcaaattgagtcaactagtgaattcaaac aacttgaagctcaatttctggaaatctccctcctccttcaatcggcctgtggatgtcctg gtcccatctgtcagtctgcaggcatttaaatccttcctgagatcccagggcttagagtac gcagtgacaattgaggacctgcaggcccttttagacaatgaagatgatgaaatgcaacac aatgaagggcaagaacggagcagtaataacttcaactacggggcttaccattccctggaa gctaaaattgtagaaaaggactattcaggcaggtctcataatcagcgtttccatgccgat aatatggtcaattccaagattagggtcccggaagagatagttaagatcaacaagatttac cacgagatggacaacattgccgcagactttcctgacctggcgaggagggtgaagattgga cattcgtttgaaaaccggccgatgtatgtactgaagattgtatctgattaccagagggat ccagctatcacctccatcttggagaaaatggatattttcttgttgcctgtggccaatcct gatggatatgtgtatactcaaactcaaaaccgattatggaggaagacgcggtcccgaaat cctggaagctcctgcattggtgctgacccaaatagaaactggaacgctagttttgcaggt ggagcgtcttgcatttctgtgttcttggactatgagttaatgtttgttcttgggctatcc ccaacaggaaagggagccagcgacaacccttgctccgaagtgtaccatggaccccacgcc aattcggaagtggaggtgaaatcagtggtagatttcatccaaaaacatgggaatttcaag ggcttcatcgacctgcacagctactcgcagctgctgatgtatccatatgggtactcagtc aaaaaggccccagatgccgaggaactcgacaaggtggcgaggcttgcggccaaagctctg gcttctgtgtcgggcactgagtaccaagtgggtcccacctgcaccactgtctatccagct agcgggagcagcatcgactgggcatatgacaacggcatcaaatttgcattcacatttgag ttgagagataccgggacctatggcttcctcctgccagctaaccagatcatccccactgca gaggagacgtggctggggctgaagaccatcatggagcatgtgcgggacaacctctactag >gi568815591f:130280521_130488008|GENSCAN_predicted_peptide_2|584_aa MIKLWNIKSKEKLFKETRNKERSPSEEQDQVLRVLAKDEKQLSLLGDLEGLKPQKVDFWR GPARPSLPVDMRVPFSELKDIKAYLESHGLAYSIMIKDIQVKPCPSWDPAFRLPFWLGPN MEEMFSGLKVDMWFLGLHQRVCEHAVEGTGCPPPHFTKASLDNVTRNFQIQPDGRLSMFL FQQHNWSLSPSWSLSLPLASRTSVFCLQPAPPLLDPTAYSVFPPGGAMGISNFPAPGMEQ TLVHFPGQGRFLFLEVGPAVLLDEERQAMAKSRRLERSTNSFSYSSYHTLEEIYSWIDNF VMEHSDIVSKIQIGNSFENQSILVLKFSTGGSRHPAIWIDTGIHSREWITHATGIWTANK IVSDYGKDRVLTDILNAMDIFIELVTNPDGFAFTHSMNRLWRKNKSIRPGIFCIGVDLNR NWKSGFGGNGSNSNPCSETYHGPSPQSEPEVAAIVNFITAHGNFKALISIHSYSQMLMYP YGRLLEPVSNQRELYDLAKDAVEALYKVHGIEYIFGSISTTLYVASGITVDWAYDSGIKY AFSFELRDTGQYGFLLPATQIIPTAQETWMALRTIMEHTLNHPY >gi568815591f:130280521_130488008|GENSCAN_predicted_CDS_2|1755_bp atgataaaactgtggaacatcaaaagcaaagagaagctcttcaaagaaaccaggaacaaa gagaggtcaccttctgaagaacaggaccaggttcttcgagtcctggccaaagatgagaag cagctttcacttctcggggatctggagggcctgaaaccccagaaggtggacttctggcgt ggcccagccaggcccagcctccctgtggatatgagagttcctttctctgaactgaaagac atcaaagcttatctggagtctcatggacttgcttacagcatcatgataaaggacatccag gtgaagccctgccccagctgggaccctgccttccgccttcctttctggttggggcccaac atggaggagatgttctcggggctaaaagtggacatgtggtttctgggtctccatcagcgt gtttgtgaacatgctgtggaaggaacaggctgcccaccccctcacttcaccaaagcttcc ctcgacaatgtcacacgcaacttccagatccaacccgatggccgactctcaatgttcctc ttccaacagcacaactggtcactctctccttcctggagcctgtctcttcccctggcatcc aggacttctgtgttctgtctccagccagcacctcctctcctggatccaaccgcctactca gtgtttccacctgggggtgcaatgggcatctccaactttccagccccaggaatggagcaa acgctggtgcattttccaggccaaggcagattcctgttcctggaagtggggccagctgtg ctgctggatgaggaaagacaggccatggcgaaatcccgccggctggagcgcagcaccaac agcttcagttactcatcataccacaccctggaggagatatatagctggattgacaacttt gtaatggagcattccgatattgtctcaaaaattcagattggcaacagctttgaaaaccag tccattcttgtcctgaagttcagcactggaggttctcggcacccagccatctggattgac actggaattcactcccgggagtggatcacccatgccaccggcatctggactgccaataag attgtcagtgattatggcaaagaccgtgtcctgacagacatactgaatgccatggacatc ttcatagagctcgtcacaaaccctgatgggtttgcttttacccacagcatgaaccgctta tggcggaagaacaagtccatcagacctggaatcttctgcatcggcgtggatctcaacagg aactggaagtcgggttttggaggaaatggttctaacagcaacccctgctcagaaacttat cacgggccctcccctcagtcggagccggaggtggctgccatagtgaacttcatcacagcc catggcaacttcaaggctctgatctccatccacagctactctcagatgcttatgtaccct tacggccgattgctggagcccgtttcaaatcagagggagttgtacgatcttgccaaggat gcggtggaggccttgtataaggtccatgggatcgagtacatttttggcagcatcagcacc accctctatgtggccagtgggatcaccgtcgactgggcctatgacagtggcatcaagtac gccttcagctttgagctccgggacactgggcagtatggcttcctgctgccggccacacag atcatccccacggcccaggagacgtggatggcgcttcggaccatcatggagcacaccctg aatcacccctactag >gi568815591f:130280521_130488008|GENSCAN_predicted_peptide_3|87_aa MAVQRSGQAGCENKFLIPDPAEPWPGSAYLTQLRDDNQDGVADDDDEKLFFLDMRCFDVL ALTSSDPVFPRPREPLRTPRGEQVISQ >gi568815591f:130280521_130488008|GENSCAN_predicted_CDS_3|264_bp atggctgtacagagatctggccaagctggctgtgagaacaagttcctcatccctgatcct gcagagccttggccgggaagtgcctacttaacacagctcagggatgacaaccaggatggt gtcgctgatgatgatgatgagaagctcttcttcctggatatgcggtgctttgatgtgctg gctctgactagctctgaccctgtgttcccaagacccagggagcctttgaggacacctcgg ggggagcaggtcataagccaatga >gi568815591f:130280521_130488008|GENSCAN_predicted_peptide_4|500_aa MLARVGLLTLLLLPGLDSHRTLRPAGGGWEVPQRDCPAQGRGSLDKAGVVTGQTDAKDGR GWSSGLSLQLPSSLPLYHGGSMRGLLVLSVLLGAVFGKEDFVGHQVLRISVADEAQVQKV KELEDLEHLQLDFWRGPAHPGSPIDVRVPFPSIQAVKIFLESHGISYETMIEDVQSLLDE EQEQMFAFRSRARSTDTFNYATYHTLEEIYDFLDLLVAENPHLVSKIQIGNTYEGRPIYV LKFSTGGSKRPAIWIDTGIHSREWVTQASGVWFAKKITQDYGQDAAFTAILDTLDIFLEI VTNPDGFAFTHSTNRMWRKTRSHTAGSLCIGVDPNRNWDAGFGLSGASSNPCSETYHGKF ANSEVEVKSIVDFVKDHGNIKAFISIHSYSQLLMYPYGYKTEPVPDQDELDQLSKAAVTA LASLYGTKFNYGSIIKAIYQASGSTIDWTYSQGIKYSFTFELRDTGRYGFLLPASQIIPT AKETWLALLTIMEHTLNHPY >gi568815591f:130280521_130488008|GENSCAN_predicted_CDS_4|1503_bp atgctggcaagggttgggctcctgaccctgctgctgctgcccgggctggactcccacagg accctaaggccagcaggagggggctgggaagtcccacagagggactgccccgcacagggc cggggcagcctggacaaagccggtgtggtgacaggccaaaccgacgcaaaagatggtcgg gggtggagctctggcttatctctccagctgcccagttccctgccactttatcatggaggc agcatgcgggggttgctggtgttgagtgtcctgttgggggctgtctttggcaaggaggac tttgtggggcatcaggtgctccgaatctctgtagccgatgaggcccaggtacagaaggtg aaggagctggaggacctggagcacctgcagctggacttctggcgggggcctgcccaccct ggctcccccatcgacgtccgagtgcccttccccagcatccaggcggtcaagatctttctg gagtcccacggcatcagctatgagaccatgatcgaggacgtgcagtcgctgctggacgag gagcaggagcagatgttcgccttccggtcccgggcgcgctccaccgacacttttaactac gccacctaccacaccctggaggagatctatgacttcctggacctgctggtggcggagaac ccgcaccttgtcagcaagatccagattggcaacacctatgaagggcgtcccatttacgtg ctgaagttcagcacggggggcagtaagcgtccagccatctggatcgacacgggcatccat tcccgggagtgggtcacccaggccagtggggtctggtttgcaaagaagatcactcaagac tacgggcaggatgcagctttcaccgccattctcgacaccttggacatcttcctggagatc gtcaccaaccctgatggctttgccttcacgcacagcacgaatcgcatgtggcgcaagact cggtcccacacagcaggctccctctgtattggcgtggaccccaacaggaactgggacgct ggctttgggttgtccggagccagcagtaacccctgctcggagacttaccacggcaagttt gccaattccgaagtggaggtcaagtccattgtagactttgtgaaggaccatgggaacatc aaggccttcatctccatccacagctactcccagctcctcatgtatccctatggctacaaa acagaaccagtccctgaccaggatgagctggatcagctttccaaggctgctgtgacagcc ctggcctctctctacgggaccaagttcaactatggcagcatcatcaaggcaatttatcaa gccagtggaagcactattgactggacctacagccagggcatcaagtactccttcaccttc gagctccgggacactgggcgctatggcttcctgctgccagcctcccagatcatccccaca gccaaggagacgtggctggcgcttctgaccatcatggagcacaccctgaatcacccctac tga >gi568815591f:130280521_130488008|GENSCAN_predicted_peptide_5|488_aa MSRDQASEEEEKPCGKVLPINTDCGSKALYLYEEEYLMKRIPQNPRYQHIKSRLDTGNSM TKYTEKLEEIKKNYRYKKDELFKRLKVTTFAQLIIQVASLSDQTLEVTAEEIQRLEDNDS AASDPDAETTARTNGKGNPGEQSPSPEQFINNAGAGDSSRSTLQSVISGVGELDLDKGPV KKAEPHTKDKPYPDCPFLLLDVRDRDSYQQCHIVGDHGNISLQKKRIPACVVSMMGNHPV PAGVTKNAHGKIIILYDDDERLASQAATTMCERGFENLFMLSGGLCHKALMLKTRAKGFL VEFSESHVTTDIFWYNLLNDEIVKKDFMSLCWPQLDDQRLEDMQASFQVLAWPCHINAAF SDDFPNLGLKVLAQKFPEGLITGSLPASCQQALPPGSARKRSSPKGPPLPAENKWRFTPE DLKKIEYYLEEEQGPADHPSRLNQANSSGRESKVPGARSAQNLPGGGPASHSNPRSLSSG HLQGKPWK >gi568815591f:130280521_130488008|GENSCAN_predicted_CDS_5|1467_bp atgagcagagaccaggcttctgaagaagaggaaaaaccctgcgggaaggtcctgcccatc aacactgactgtgggtccaaagccctttacctctatgaggaagagtatctgatgaaaagg ataccacagaacccaagataccagcatatcaaatcaagactggacactggtaacagtatg actaaatatactgagaagctcgaagagattaagaaaaattatagatacaaaaaagatgag cttttcaagagactaaaagttacaacttttgcccagctgatcatccaagttgcttccctc tctgatcaaacactggaagtgacagctgaggagattcaaaggctggaagacaatgattct gcagcttcagaccctgatgctgaaaccactgccaggaccaatgggaaaggaaatccaggt gagcagtcgccgagccctgagcagttcataaacaacgcaggagcaggggactccagccgc tcaactcttcagagtgtcatcagtggtgttggggaactggatctagacaaagggccagtg aagaaagcagagccccataccaaagacaaaccttatcctgactgccccttcctgctgcta gatgtgcgtgatagagattcttaccagcagtgccacattgttggagatcatggtaacatc agtttgcaaaagaaacgaataccagcttgtgtggtttccatgatgggaaaccatccagtc cctgctggtgtgacgaaaaatgcccatggcaagatcatcattctgtatgacgatgatgaa aggctggccagtcaggcggccaccaccatgtgcgagcgtggatttgaaaacctcttcatg ctttccggagggctctgtcacaaagccctaatgctgaagactcgagcaaaaggatttctt gtggaattctcagaatcccatgtaacaacagatattttctggtataatttgcttaatgat gaaatagttaagaaagatttcatgtccttgtgttggcctcaacttgatgaccagagatta gaagacatgcaggcttctttccaggttttggcctggccttgccatattaatgcagctttt tctgatgattttccaaacctaggtctaaaagtcttagctcagaaattcccggaaggactg attactggttccctgccagcatcttgccagcaggcccttcctcctgggtctgcccggaaa cgatccagccccaaagggccacccctaccagctgagaataaatggagatttaccccagaa gacttaaaaaagatagaatattatctggaagaggagcaagggcctgcagatcatcctagc cgactgaaccaagctaactcctccggaagagagtccaaggtgcctggtgcccgaagcgct cagaatctgccaggtggcggccccgccagccactcaaacccccgctccctcagcagtggt cacctgcaaggcaaaccctggaagtaa >gi568815591f:130280521_130488008|GENSCAN_predicted_peptide_6|129_aa MLCDSLPGVCILAETALEVRGKRLINFKDLELLIGKMEHLQITKSSGDADAAGTTIQCGA DTTLLLLSSKHGSCDPGLVIRVPVFREGHGPGQYSLVDLGSGTFGFDFPSTFGADSSCFL HSWCQHCEE >gi568815591f:130280521_130488008|GENSCAN_predicted_CDS_6|390_bp atgctctgtgatagtcttccaggtgtctgcatcttggctgaaactgctttggaggttcgg ggaaaaagactcatcaacttcaaggatttagagttgctgattgggaagatggagcatttg caaattaccaaatcctcaggtgatgccgatgctgcaggaaccactattcagtgtggggct gataccaccctgttgctactcagctccaagcatggttcctgtgacccaggcctggtcatt agagtcccagtgttcagggaaggacatgggccaggacagtactctctggtagaccttggc tccgggacattcgggtttgatttccctagtacctttggtgctgacagctcctgcttccta cacagctggtgccagcactgtgaagagtga >gi568815591f:130280521_130488008|GENSCAN_predicted_peptide_7|100_aa XKTLPTGLHVSENRLRMRFLRPEVLGLHPVPREDPATNTRLGARASPGPRTSPSPPSPAC AAAFSLPGVSATYCKGYEMAGRGFDNSLRHWIKRIKLYSA >gi568815591f:130280521_130488008|GENSCAN_predicted_CDS_7|303_bp ngtaagaccttgcctacaggactccatgtttcggagaaccggttgcgcatgcgcttcctg aggccagaggttctcggccttcaccctgttcccagggaggacccggccacgaatacgcgg ctcggggctagagcgtcgccaggtccgcgcacaagcccatcgcccccgagcccagcatgt gctgccgccttctccctgccaggcgtcagcgccacctattgcaaagggtatgaaatggca ggccgtggcttcgacaatagcctccggcattggataaagagaataaaattgtattcggcc tga