GENSCAN 1.0 Date run: 7-Nov-116 Time: 23:58:30 Sequence gi568815592r:152892857_153102685 : 209829 bp : 37.50% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 5201 5671 471 1 0 59 41 368 0.353 25.07 1.02 PlyA + 7951 7956 6 1.05 2.03 PlyA - 8520 8515 6 1.05 2.02 Term - 19157 18856 302 0 2 82 31 177 0.813 5.80 2.01 Init - 29258 29165 94 2 1 86 85 32 0.513 3.29 2.00 Prom - 31219 31180 40 -3.65 3.00 Prom + 35544 35583 40 -4.55 3.01 Init + 35695 35709 15 0 0 77 87 16 0.673 0.74 3.02 Term + 39595 39735 141 0 0 69 34 169 0.929 6.55 3.03 PlyA + 41322 41327 6 1.05 4.02 PlyA - 42140 42135 6 1.05 4.01 Sngl - 66922 66713 210 1 0 80 46 228 0.681 12.75 4.00 Prom - 73992 73953 40 -3.65 5.07 PlyA - 74749 74744 6 1.05 5.06 Term - 76914 76811 104 1 2 42 42 102 0.315 -1.54 5.05 Intr - 78558 78345 214 0 1 122 47 120 0.661 8.67 5.04 Intr - 79598 79416 183 2 0 53 76 140 0.767 8.36 5.03 Intr - 80280 80190 91 2 1 58 103 25 0.808 0.08 5.02 Intr - 82765 82051 715 2 1 88 108 387 0.966 30.26 5.01 Init - 83587 83470 118 1 1 60 74 103 0.960 6.51 5.00 Prom - 87195 87156 40 -7.55 6.04 PlyA - 89033 89028 6 1.05 6.03 Term - 94376 94278 99 2 0 100 44 107 0.728 4.65 6.02 Intr - 95190 95141 50 1 2 66 62 51 0.429 -2.22 6.01 Init - 96249 96111 139 0 1 62 107 90 0.489 8.66 6.00 Prom - 96807 96768 40 -10.25 7.15 PlyA - 96816 96811 6 1.05 7.14 Term - 97239 97039 201 0 0 77 38 163 0.924 6.61 7.13 Intr - 98465 98329 137 0 2 67 98 65 0.997 4.77 7.12 Intr - 100118 100001 118 2 1 66 89 142 0.999 11.22 7.11 Intr - 101820 101657 164 1 2 92 94 83 0.999 8.07 7.10 Intr - 102438 102280 159 1 0 62 68 98 0.913 4.24 7.09 Intr - 105020 104825 196 2 1 -24 66 192 0.273 3.87 7.08 Intr - 105773 105694 80 0 2 123 80 54 0.302 6.45 7.07 Intr - 109765 109571 195 2 0 21 103 189 0.499 12.16 7.06 Intr - 110451 109857 595 0 1 -1 53 327 0.199 11.51 7.05 Intr - 131640 131406 235 2 1 93 83 147 0.773 11.47 7.04 Intr - 133687 133598 90 0 0 68 101 69 0.646 4.39 7.03 Intr - 138159 138066 94 1 1 92 81 30 0.652 0.80 7.02 Intr - 141661 141485 177 2 0 58 74 56 0.277 0.17 7.01 Init - 145181 145082 100 2 1 72 77 119 0.702 9.67 7.00 Prom - 149586 149547 40 -6.05 8.00 Prom + 157600 157639 40 -4.85 8.01 Init + 163506 163550 45 2 0 91 72 14 0.313 0.83 8.02 Term + 170301 170621 321 2 0 68 43 204 0.933 7.74 8.03 PlyA + 170700 170705 6 1.05 9.00 Prom + 181240 181279 40 -3.45 9.01 Init + 183624 183737 114 2 0 79 105 77 0.963 8.76 9.02 Intr + 195206 195288 83 1 2 71 113 23 0.109 0.62 9.03 Term + 200605 200716 112 1 1 74 41 83 0.090 -0.75 9.04 PlyA + 200891 200896 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 122324 122458 135 1 0 50 24 157 0.867 5.69 S.002 Term + 122587 122703 117 0 0 28 45 162 0.855 3.46 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:152892857_153102685|GENSCAN_predicted_peptide_1|156_aa MSMPSSLLALSMRIATCSWSASVCTAMRPQVASMCPTLSSWIWSPAPGTLVCSGSFVQIL RLDNFIFHQSGAGNNWANGHYAEGAELVDSVLDIVRREAESCDCLQDFQLTHSLGAGTGP KRGTPLISKIPEKYPDRILNKFSVPLRKVSDMGVEP >gi568815592r:152892857_153102685|GENSCAN_predicted_CDS_1|471_bp atgagcatgccatcgagcctactggcacttagcatgcggatagcaacctgcagctggagt gcatcagtgtgtactgcaatgagaccacaggtggcaagtatgtgccccacgctgagctca tggatctggagcccagcaccggggactctggtgtgctcaggctccttcgtgcagatctta cggctggacaacttcatcttccatcagagtggtgccggaaacaactgggccaatgggcac tacgcagaaggtgcggagctggtggattcggtactggacattgtgaggagggaggctgag agctgtgactgcctgcaggatttccagctgacccactccctaggagcagggactgggcct aagaggggtacccctctcatcagcaagatcccagagaagtatccagacaggatcttgaac aagttcagtgtgcccttacgcaaggtgtcagacatgggggtggagccctaa >gi568815592r:152892857_153102685|GENSCAN_predicted_peptide_2|131_aa MVEGERHVLHYGRQEKRAYTGKLPPYKTIRSEIPEALREYCKHLYACKLENLEEMDTFLE TYNCPRLNQKETETLNRPIMISKIELVIKSLSTRKSPGPDRFTAEFYWMYEEELVPILLT LFKKIEKEGLL >gi568815592r:152892857_153102685|GENSCAN_predicted_CDS_2|396_bp atggtggaaggtgaaaggcacgtcttacattatggcaggcaagagaagagagcttataca gggaaactccccccttataaaaccatcaggtctgaaataccagaagccctcagagagtat tgtaaacacctctatgcatgcaagctagaaaacctggaagagatggatacattcttggaa acatacaactgcccaagattgaaccagaaagaaactgaaaccctaaacagaccaataatg atctccaaaattgaattagtaataaaaagcctatcaacaagaaaaagccctggaccagac agattcacagctgaattctactggatgtatgaggaagagctggtaccaatcctactgaca ttattcaaaaaaattgagaaggagggactcctctaa >gi568815592r:152892857_153102685|GENSCAN_predicted_peptide_3|51_aa MASWKADGEFEEPIAEEGTGDAVCKLTAGCYQEDNKNNTCHTWRRNSSGQQ >gi568815592r:152892857_153102685|GENSCAN_predicted_CDS_3|156_bp atggcgtcatggaaagctgatggagaatttgaggagccgatagctgaagaaggaacaggt gatgctgtctgtaaattgactgctggatgttaccaagaagacaataaaaacaatacgtgc catacctggaggcgaaactccagtggacagcagtag >gi568815592r:152892857_153102685|GENSCAN_predicted_peptide_4|69_aa MSLGFSDCTTTASPTTMSSGSENHTNTTNPRRKYGPHIGVAEQLWGIPGGKINWRRTELK KKLISAGRC >gi568815592r:152892857_153102685|GENSCAN_predicted_CDS_4|210_bp atgtctttgggattcagcgactgcaccacaacagcaagccccacaaccatgtcctcggga agtgaaaaccacaccaatacaacaaaccccaggcggaagtacggaccgcatattggcgtc gccgagcaactgtgggggatcccggggggaaaaatcaactggcgccgaaccgagctgaag aaaaagctgatcagcgcaggccggtgttga >gi568815592r:152892857_153102685|GENSCAN_predicted_peptide_5|474_aa MSRYSSLLSSKVDKRFEYIRRSPGAPIRIKLGDKSCGNSGCKEESSTLSVKMKCDFNCNH VHSGLKLVKPDDIGRLVSYTPAYLEGSCKDCIKDYERLSCIGSPIVSPRIVQLETESKRL HNKENQHVQQTLNSTNEIEALETSRLYEDSGYSSFSLQSGLSEHEEGSLLEENFGDSLQS CLLQIQSPDQYPNKNLLPVLHFEKVVCSTLKKNAKRNPKVDREMLKEIIARGNFRLQNII GRKMGLECVDILSELFRRGLRHVLATILAQLSDMDLINVSKVSTTWKKILEDDKGAFQLY SKAIQRVTENNNKFSPHASTREYVMFRTPLASVQKSAAQTSLKKDAQTKLSNQGDQKGST YSRHNEFSEVAKTLKKNESLKACIRCNSPAKYDCYLQRATCKREGCGFDYCTKCLCNYHT TKDCSDGKLLKASCKIGPLPGSMLLDIPVHQLMDVWLFTPFPAVTSKVLSVQSN >gi568815592r:152892857_153102685|GENSCAN_predicted_CDS_5|1425_bp atgtcccggtactcgagtttactatccagtaaggtagacaaacgctttgaatatattaga aggtcaccaggagcacccatcagaataaaacttggagataagagctgtgggaattcaggt tgtaaagaagaaagttctaccctttctgtcaaaatgaagtgtgattttaattgtaaccat gttcattccggacttaaactggtaaaacctgatgacattggaagactagtttcctacacc cctgcatatttggaaggttcctgtaaagactgcattaaagactatgaaaggctgtcatgt attgggtcaccgattgtgagccctaggattgtacaacttgaaactgaaagcaagcgcttg cataacaaggaaaatcaacatgtgcaacagacacttaatagtacaaatgaaatagaagca ctagagaccagtagactttatgaagacagtggctattcctcattttctctacaaagtggc ctcagtgaacatgaagaaggtagcctcctggaggagaatttcggtgacagtctacaatcc tgcctgctacaaatacaaagcccagaccaatatcccaacaaaaacttgctgccagttctt cattttgaaaaagtggtttgttcaacattaaaaaagaatgcaaaacgaaatcctaaagta gatcgggagatgctgaaggaaattatagccagaggaaattttagactgcagaatataatt ggcagaaaaatgggcctagaatgtgtagatattctcagcgaactctttcgaaggggactc agacatgtcttagcaactattttagcacaactcagtgacatggacttaatcaatgtgtct aaagtgagcacaacttggaagaagatcctagaagatgataagggggcattccagttgtac agtaaagcaatacaaagagttaccgaaaacaacaataaattttcacctcatgcttcaacc agagaatatgttatgttcagaaccccactggcttctgttcagaaatcagcagcccagact tctctcaaaaaagatgctcaaaccaagttatccaatcaaggtgatcagaaaggttctact tatagtcgacacaatgaattctctgaggttgccaagacattgaaaaagaacgaaagcctc aaagcctgtattcgctgtaattcacctgcaaaatatgattgctatttacaacgggcaacc tgcaaacgagaaggctgtggatttgattattgtacgaagtgtctctgtaattatcatact actaaagactgttcagatggcaagctcctcaaagccagttgtaaaataggtcccctgcct ggcagtatgctgttggatataccagttcaccagttgatggatgtttggctctttactcca tttcctgctgttacaagtaaggtgctgtcagtgcaatcaaattag >gi568815592r:152892857_153102685|GENSCAN_predicted_peptide_6|95_aa MVKSGLMERVATAYSVIGKFPVLPWARAGVPLVDQVLQAKNLTLPTAAQNEDTIPEHLNI LHKASFVPISSEDIPYSHGSHPEGLLPPQETSDNV >gi568815592r:152892857_153102685|GENSCAN_predicted_CDS_6|288_bp atggtgaagtcaggcctaatggagagagtggctacagcctactcagtgattgggaaattc cctgtcttgccctgggccagagctggtgtaccgctggtggatcaggtcttacaagcaaag aacctcacactcccaactgcagcacaaaatgaagataccatacctgaacacctgaacatt cttcacaaggcctcctttgtgcccatttcttcagaagacatcccctacagccatggttct catccagagggtcttttgcccccgcaggagacatctgacaacgtttag >gi568815592r:152892857_153102685|GENSCAN_predicted_peptide_7|846_aa MYRSFGDDLGNRLVLFIPMAMAPSALEKAEHMKGIISVPLQVSGPEEFTRLCRIWLLQAW QGSVCMLSVKVADAVSSVCITSRFPALTLIGTGLHLANNCPFSTQFNLYVFCEAFLNSLM QKHSLTVRNEERGENAGRPTHTTKMESIQVLEECQNPTAEEVLSWSQNFDKMMKAPAGRN LFREFLRTEYSEENLLFWLACEDLKKEQNKKVIEEKARMIYEDYISILSPKEWLSHNTLI KSLLPRTSPTVDKRNISRKKGGGINTRLSIIKLALEDSAAALHRRLTTCAAVTSALQRTL DCPALRLWRPRLSAWERNPHYQKTEGQSAALPFRPNTSPINRIPTKRLSEEAPKVSQARP SPITLRSAPSTFGLREVRSVLTLGRKFPPRLPENPLKSNACAGRGLVRSHPYLQILGVVA RAAVSGTSGSARRPLSSGSPPLEELFTRGGPLRTFLERQAGSEAHLKVRRPELLAVIKLL NEKERELRETEHLLHDENEDLRKLAENEITLCQKEITQLKHQFLRSGNKKKEEPNKDCNM SAYGFPIEALKIACLMRGMSRSTGVGEDSGEAVPGILLKLWLSQNPHKETDENDLILEVT AGVGGQEAMLFTSEIFDMYQQYAAFKRWHFETLEYFPSELGGLRHASASIGGSEAYRHMK FEGGVHRVQRVPKTEKQGRVHTSTMTVAILPQPTEINLVINPKDLRIDTKRASGAGGQHV NTTDSAVRIVHLPTGVVSECQQERSQLKNKELAMTKLRAKLYSMHLEEEINKRQNARKIQ IGSKGRSEKIRTYNFPQNRVTDHRINKTLHDLETFMQGDYLLDELVQSLKEYADYESLVE IISQKV >gi568815592r:152892857_153102685|GENSCAN_predicted_CDS_7|2541_bp atgtaccggagctttggggatgaccttggtaatagactggtgctgttcatacccatggcc atggctccctctgccctagagaaggctgaacacatgaaaggtattatttctgtcccttta caagtctcaggcccagaggaatttaccagactgtgtaggatctggctattgcaggcttgg caaggttctgtttgcatgttgtcagttaaggtagcggatgctgtttcttcagtctgtatc acttcgcgcttcccagccctcacattaatcggcacaggtctccatctggcaaacaactgc ccattttcaacccaatttaatctttacgtcttctgtgaagcctttctcaattctcttatg caaaagcacagcctcactgtgaggaatgaagaaagaggggaaaatgcgggaagacccaca cacactacaaaaatggagagtatccaggtcctagaggaatgccaaaaccccactgcagag gaagtcttgtcctggtctcaaaattttgacaagatgatgaaggccccagcaggaagaaac cttttcagagagttcctccgaacagaatacagtgaagagaacctacttttctggcttgct tgtgaagacttaaagaaggagcagaacaaaaaagtaattgaagaaaaggctaggatgata tatgaagattacatttctatactatcaccaaaagagtggttatctcataatacacttatc aagtccttattaccacgaacctcacccaccgtggacaagagaaacataagcaggaaaaaa ggaggaggaataaacacacgcctgtccataataaaactcgctcttgaagactcagcggca gccctgcaccggagactgacgacttgcgcggctgtgacctccgccctgcagcggaccctc gactgccctgcactgcggctctggaggccccgactcagtgcatgggaaagaaatcctcac tatcagaaaacagaggggcaatctgctgctctccctttccggccaaacacgtcacccatc aaccggatacctaccaagaggctttcagaggaggcgcccaaggtctcccaggcccgcccc tccccaatcacgctccgctcagccccctcaacttttggcctccgggaagttcgcagcgtt ctcacgcttggcaggaagttcccgccaaggcttccggaaaatcctttaaaaagcaacgct tgcgctgggcggggcttggtgcgctctcacccttatctccaaattctgggtgttgtcgcg agggctgctgtgtccggaacttccggttccgcccgccggcccctgagctccggtagcccg ccgctggaggagctgttcacccggggcgggcccttgcggaccttcctcgagcgccaggcg gggtctgaagcccatttgaaggtcaggaggcccgagttgctggcggtgatcaaactgctg aacgagaaggagcgggagctgcgggagactgagcacttgctgcacgatgagaatgaagat ttaaggaaacttgcagagaatgaaatcactttgtgtcaaaaagaaataactcagctgaag catcagtttttaagatcaggaaacaaaaaaaaggaggagccaaataaggactgtaacatg agtgcctatggatttcccattgaagctctcaaaattgcctgtttgatgagaggaatgagc aggagcactggtgtaggagaggactctggtgaagctgtccctggtattttgctaaaactc tggctttctcaaaaccctcataaagaaacagatgaaaatgatttgatcctggaagtaact gcaggagttggaggtcaggaggcaatgttgtttacatcagagatatttgatatgtatcag caatatgctgcatttaaaagatggcattttgaaaccctggaatattttccaagtgaacta ggtggccttagacatgcatctgccagcattgggggttcagaagcctataggcacatgaaa tttgaaggaggtgttcacagagtacaaagagtgccaaagacagaaaagcaaggccgcgtc catactagcaccatgactgtagcaatattaccccagcctactgagattaatctggtgatt aatccgaaagatttgagaattgacactaagcgagccagtggagctggggggcagcatgta aataccacggacagtgctgtccggatagttcatcttccaacaggtgttgtttctgaatgt caacaagagagatctcagctgaaaaataaagagctggctatgacaaagttacgtgcaaaa ctgtacagcatgcatctagaagaagaaataaataaaagacagaatgctagaaaaattcag attggaagtaaaggaagatcagagaaaataagaacatataattttccacagaaccgggtc acagatcacagaataaacaagacgctgcatgatcttgaaacttttatgcaaggagattat ctactggatgaacttgtacagtcattgaaggaatacgccgattatgaatctttagtagaa attatttcccaaaaagtttaa >gi568815592r:152892857_153102685|GENSCAN_predicted_peptide_8|121_aa MHRIRGYAFTCKIPKTQTNIYNYQDTVDSCRKKRTSPNELNKPPRINLEETEIYDLSDRE FKISVLRKPKEIQENTEKEFRILSDKFNKDNEIIKKNQADILELKNAIGIMKNASESQKQ N >gi568815592r:152892857_153102685|GENSCAN_predicted_CDS_8|366_bp atgcacaggataagaggatatgcattcacttgtaagatcccaaagacacagacaaacatc tacaactatcaagacactgttgatagttgtaggaaaaaaaggacctcaccaaatgaacta aataagccaccaaggattaatcttgaagaaacagagatatatgacctttcggacagagaa ttcaaaatatcagttttgaggaaacccaaagaaattcaagagaacacagagaaggaattc agaattctatcagataaatttaacaaagataatgaaataattaaaaagaatcaagcagac attctggagctgaaaaatgcaattggcataatgaagaatgcatctgagtctcaaaagcag aattga >gi568815592r:152892857_153102685|GENSCAN_predicted_peptide_9|102_aa MNIYRVSGSRVLNTIPHKKEPELDGEIVIQGLTQKNPKGTGSKELIQSLLEDQQSGWTRG LLLTSTKMFTLSSTIFHKLLQYQARDSTPDYDHFLPYFKDED >gi568815592r:152892857_153102685|GENSCAN_predicted_CDS_9|309_bp atgaacatatatagagtatctggatcccgggttctaaataccattccccacaaaaaggaa ccagagctggatggagaaatagtgattcaaggtctgacacagaaaaatcctaagggcaca ggtagcaaggaactgatacagtccttgctagaggaccaacaatcagggtggactaggggt ctcctgctaacatccacgaaaatgttcactttatcttccacaatcttccacaagcttctt cagtaccaggcaagggactcaactcctgattatgatcatttcttgccttactttaaggat gaggactaa