GENSCAN 1.0 Date run: 3-Nov-116 Time: 13:32:11 Sequence gi568815583r:66448493_66665365 : 216873 bp : 46.52% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 1549 2982 1434 0 0 42 49 501 0.849 37.42 1.02 PlyA + 3027 3032 6 -0.45 2.00 Prom + 3323 3362 40 -2.46 2.01 Init + 3447 3579 133 2 1 78 47 40 0.018 -0.80 2.02 Intr + 14735 14786 52 0 1 60 98 35 0.004 -0.33 2.03 Intr + 28650 28720 71 0 2 108 92 3 0.004 1.53 2.04 Intr + 33263 33387 125 0 2 134 68 230 0.949 25.90 2.05 Intr + 36498 36638 141 2 0 89 44 57 0.709 1.95 2.06 Term + 42010 42123 114 0 0 95 42 112 0.939 5.87 2.07 PlyA + 43012 43017 6 1.05 3.03 PlyA - 45347 45342 6 1.05 3.02 Term - 46060 45944 117 1 0 68 43 186 0.999 10.64 3.01 Init - 49239 49150 90 0 0 97 55 197 0.985 17.79 3.00 Prom - 50419 50380 40 -11.92 4.08 PlyA - 50859 50854 6 1.05 4.07 Term - 51160 50915 246 1 0 76 37 314 0.981 20.89 4.06 Intr - 51684 51564 121 2 1 69 82 83 0.999 6.30 4.05 Intr - 52613 52457 157 0 1 78 108 83 0.999 8.37 4.04 Intr - 53012 52883 130 2 1 34 51 189 0.559 10.07 4.03 Intr - 53420 53296 125 0 2 57 73 181 0.999 13.80 4.02 Intr - 55037 54866 172 2 1 105 80 235 0.999 23.92 4.01 Init - 56298 56296 3 0 0 98 101 0 0.815 2.30 4.00 Prom - 62490 62451 40 -5.46 5.00 Prom + 66913 66952 40 -1.46 5.01 Init + 70409 70477 69 1 0 91 86 -12 0.650 -0.04 5.02 Intr + 72558 72713 156 2 0 87 99 81 0.978 9.31 5.03 Intr + 81194 81259 66 1 0 80 116 21 0.754 3.20 5.04 Intr + 83755 83911 157 0 1 60 97 50 0.747 2.68 5.05 Term + 84965 84990 26 0 2 87 42 23 0.220 -4.01 5.06 PlyA + 85455 85460 6 1.05 6.12 PlyA - 86361 86356 6 1.05 6.11 Term - 91537 91514 24 1 0 91 44 29 0.297 -2.98 6.10 Intr - 98303 98163 141 2 0 54 91 68 0.686 4.25 6.09 Intr - 103369 103170 200 2 2 53 80 115 0.942 6.27 6.08 Intr - 109391 109230 162 0 0 128 85 134 0.999 17.05 6.07 Intr - 109544 109490 55 2 1 117 105 30 0.779 6.15 6.06 Intr - 112609 112514 96 1 0 82 100 87 0.872 9.51 6.05 Intr - 112823 112695 129 2 0 97 102 153 0.995 18.49 6.04 Intr - 115133 115024 110 0 2 69 94 111 0.992 9.80 6.03 Intr - 115506 115419 88 0 1 99 83 51 0.719 5.24 6.02 Intr - 116347 116184 164 2 2 70 109 119 0.994 11.89 6.01 Init - 116873 116756 118 2 1 109 60 173 0.998 14.96 6.00 Prom - 122787 122748 40 -2.06 7.04 PlyA - 124049 124044 6 1.05 7.03 Term - 126211 126092 120 1 0 89 48 72 0.366 1.77 7.02 Intr - 138818 138502 317 0 2 -68 3 423 0.495 13.98 7.01 Init - 140638 140578 61 1 1 78 110 43 0.972 6.91 7.00 Prom - 142246 142207 40 -6.86 8.04 PlyA - 142480 142475 6 1.05 8.03 Term - 143509 143426 84 1 0 49 49 116 0.267 1.45 8.02 Intr - 145529 145445 85 1 1 111 76 23 0.165 3.22 8.01 Init - 155995 155895 101 1 2 54 108 49 0.189 3.24 8.00 Prom - 160789 160750 40 -7.06 9.00 Prom + 161225 161264 40 2.14 9.01 Init + 162335 162342 8 1 2 103 91 0 0.469 2.30 9.02 Intr + 170928 171041 114 0 0 59 85 53 0.348 1.66 9.03 Intr + 173469 173555 87 0 0 88 41 63 0.258 0.69 9.04 Intr + 173876 173969 94 2 1 70 49 60 0.330 0.27 9.05 Intr + 181306 181390 85 0 1 105 23 96 0.400 4.19 9.06 Intr + 187621 187720 100 2 1 93 98 34 0.486 4.07 9.07 Term + 193419 193554 136 0 1 66 46 80 0.132 -0.91 9.08 PlyA + 194712 194717 6 -0.45 10.04 PlyA - 195081 195076 6 1.05 10.03 Term - 195644 195479 166 1 1 77 42 110 0.216 2.69 10.02 Intr - 204134 203992 143 2 2 76 32 149 0.489 7.25 10.01 Init - 216575 216540 36 2 0 61 96 56 0.466 3.71 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:66448493_66665365|GENSCAN_predicted_peptide_1|477_aa MIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDIPTANIIPNGQKLEAFPLK TGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVS AQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLT RDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPM TFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNR DIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLT PYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGVGKDFMSKTPKAMATKAKIDKWDLI KLKSFCTAKETTIRVNRQPTTWEKIFATYSSDKGLISRIYNELKQIYKKKTTPSKSG >gi568815583r:66448493_66665365|GENSCAN_predicted_CDS_1|1434_bp atgattatctcaatagatgcagaaaaagcctttgacaaaattcaacaacccttcatgcta aaaactctcaataaattaggtattgatgggacgtatttcaaaataataagagctatctat gacatacccacagccaatatcataccgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggcaatcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagacgacatgattgtttatctagaaaaccccatcgtctca gcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaat gtacaaaaatcacaagcattcttatacaccaacaacagacaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttaca agggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagag gacacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaa atggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaacagaacagagccctcagaaataatgccacatatctacaactat ctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaatcaattcaagatggattaaagatttaaacgttagacctaaaaccata aaaaccctagaagaaaacctaggcattaccattcaggacataggcgtgggcaaggacttc atgtccaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactcaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca acatgggagaaaattttcgcaacctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaacaaccccatcaaaaagtgggtga >gi568815583r:66448493_66665365|GENSCAN_predicted_peptide_2|211_aa MEYYAAIKNDEFMSFVGTWMKLETIILSKLSQEQKTKHRIFSLIDWKDSRIYVKREDISI EQQCFSLERVGEAGADSQGWWQAVLYVKPSNILVNSRGEIKLCDFGVSGQLIDSMANSFV GTRSYMSPERLQGTHYSVQSDIWSMGLSLVEMAVGRYPIPPPDAKELELMFGCQVHAFIK RSDAEEVDFAGWLCSTIGLNQPSTPTHAAGV >gi568815583r:66448493_66665365|GENSCAN_predicted_CDS_2|636_bp atggaatactatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aaattggaaaccatcattctcagtaaactatcgcaagaacaaaaaaccaaacaccgcata ttctcactcatagactggaaggacagtaggatatacgtgaagagagaagatatctcgatt gaacagcagtgtttttccctggagcgggtgggagaagctggagctgacagccagggatgg tggcaagctgtgttatatgtcaagccctccaacatcctagtcaactcccgtggggagatc aagctctgtgactttggggtcagcgggcagctcatcgactccatggccaactccttcgtg ggcacaaggtcctacatgtcgccagaaagactccaggggactcattactctgtgcagtca gacatctggagcatgggactgtctctggtagagatggcggttgggaggtatcccatccct cctccagatgccaaggagctggagctgatgtttgggtgccaggttcatgcttttatcaag agatctgatgctgaggaagtggattttgcaggttggctctgctccaccatcggccttaac cagcccagcacaccaacccatgctgctggcgtctaa >gi568815583r:66448493_66665365|GENSCAN_predicted_peptide_3|68_aa MLSRLQELRKEEETLLRLKAALHDQLNRLKMLVHVDNEASINQTTLELSTKSHVTEEEEE EEEEESDS >gi568815583r:66448493_66665365|GENSCAN_predicted_CDS_3|207_bp atgctgagccggcttcaggaactgcgcaaggaggaggagacgctgctgcggttgaaggca gccctgcacgaccagctgaaccgcctcaagatgttggtgcatgtagacaatgaagcatca atcaaccaaacaaccctggagctgagcacaaagagtcatgtgacggaagaggaggaggag gaagaggaagaagaatcagattcctaa >gi568815583r:66448493_66665365|GENSCAN_predicted_peptide_4|317_aa MACARPLISVYSEKGESSGKNVTLPAVFKAPIRPDIVNFVHTNLRKNNRQPYAVSELAGH RIEEVPELPLVVEDKVEGYKKTKEAVLLLKKLKAWNDIKKVYASQRMRAGKGKMRNRRRI QRRGPCIIYNEDNGIIKAFRNIPGITLLNVSKLNILKLAPGGHVGRFCIWTESAFRKLDE LYGTWRKAASLKSNYNKKIHRRVLKKNPLKNLRIMLKLNPYAKTMRRNTILRQARNHKLR VDKAAAAAAALQAKSDEKAAVAGKKPVVGKKGKKAAVGVKKQKKPLVGKKAAATKKPAPE KKPAEKKPTTEEKKPAA >gi568815583r:66448493_66665365|GENSCAN_predicted_CDS_4|954_bp atggcgtgtgctcgcccactgatatcggtgtactccgaaaagggggagtcatctggcaaa aatgtcactttgcctgctgtattcaaggctcctattcgaccagatattgtgaactttgtt cacaccaacttgcgcaaaaacaacagacagccctatgctgtcagtgaattagcaggtcat cgtattgaggaagttcctgaacttcctttggtagttgaagataaagttgaaggctacaag aagaccaaggaagctgttttgctccttaagaaacttaaagcctggaatgatatcaaaaag gtctatgcctctcagcgaatgagagctggcaaaggcaaaatgagaaaccgtcgccgtatc cagcgcaggggcccgtgcatcatctataatgaggataatggtatcatcaaggccttcaga aacatccctggaattactctgcttaatgtaagcaagctgaacattttgaagcttgctcct ggtgggcatgtgggacgtttctgcatttggactgaaagtgctttccggaagttagatgaa ttgtacggcacttggcgtaaagccgcttccctcaagagtaactacaacaagaagatccat cgcagagtcctaaagaagaacccactgaaaaacttgagaatcatgttgaagctaaaccca tatgcaaagaccatgcgccggaacaccattcttcgccaggccaggaatcacaagctccgg gtggataaggcagctgctgcagcagcggcactacaagccaaatcagatgagaaggcggcg gttgcaggcaagaagcctgtggtaggtaagaaaggaaagaaggctgctgttggtgttaag aagcagaagaagcctctggtgggaaaaaaggcagcagctaccaagaaaccagcccctgaa aagaagcctgcagagaagaaacctactacagaggagaagaagcctgctgcataa >gi568815583r:66448493_66665365|GENSCAN_predicted_peptide_5|157_aa MAHNPNMTHLKINLPVTALPPLWVTSKGFAQYELFKSSALDDTITASQTAIALDISWSPV DEILQIPPLSSTATLLCKVRQVPLLFLCPNILICKVKLHSGSNSLLSKLIHQSYHGTMDT VSLSGTIPVQMLLEIGLDKLKKDYISFFIAHSETSES >gi568815583r:66448493_66665365|GENSCAN_predicted_CDS_5|474_bp atggctcacaatcctaatatgacccatttgaagattaatctgccagttactgcccttcct cccctttgggtaacatccaaaggctttgcccagtatgagctctttaagtcctctgccttg gatgatacaatcacagcatcacaaactgcgatcgctttggatatttcctggagtcctgtg gatgagattcttcaaatccctccactctcttcaactgcaactctgctgtgtaaagttcgg caggttcctttacttttcttgtgccccaacatcctcatctgtaaagtaaagctccatagt ggaagtaacagtttactaagtaagctcattcatcagtcttatcatggaaccatggacaca gtttctctcagtgggactattccagttcaaatgcttttggaaattggtttggacaaacta aagaaagattatatcagttttttcatagcccattcagaaacttcagaaagttga >gi568815583r:66448493_66665365|GENSCAN_predicted_peptide_6|428_aa MKPVWVATLLWMLLLVPRLGAARKGSPEEASFYYGTFPLGFSWGVGSSAYQTEGAWDQDG KGPSIWDVFTHSGKGKVLGNETADVACDGYYKVQEDIILLRELHVNHYRFSLSWPRLLPT GIRAEQVNKKGIEFYSDLIDALLSSNITPIVTLHHWDLPQLLQVKYGGWQNVSMANYFRD YANLCFEAFGDRVKHWITFSDPRAMAEKGYETGHHAPGLKLRGTGLYKAAHHIIKAHAKA WHSYNTTWRSKQQGLVGISLNCDWGEPVDISNPKDLEAAERYLQFCLGWFANPIYAGDYP QVMKDYIAIKDGANIKGYTSWSLLDKFEWEKGYSDRYGFYYVEFNDRNKPRYPKASVQYY KKIIIANGFPNPREQIDIDNVERTKRKVLITKYLTKRLKCEHKYLIEDFISTLHLKCTWL QPFSEVFM >gi568815583r:66448493_66665365|GENSCAN_predicted_CDS_6|1287_bp atgaagccagtgtgggtcgccacccttctgtggatgctactgctggtgcccaggctgggg gccgcccggaaggggtccccagaagaggcctccttctactatggaaccttccctcttggc ttctcctggggcgtgggcagttctgcctaccagacggagggcgcctgggaccaggacggg aaagggcctagcatctgggacgtcttcacacacagtgggaaggggaaagtgcttgggaat gagacggcagatgtagcctgtgacggctactacaaggtccaggaggacatcattctgctg agggaactgcacgtcaaccactaccgattctccctgtcttggccccggctcctgcccaca ggcatccgagccgagcaggtgaacaagaagggaatcgaattctacagtgatcttatcgat gcccttctgagcagcaacatcactcccatcgtgaccttgcaccactgggatctgccacag ctgctccaggtcaaatacggtgggtggcagaatgtgagcatggccaactacttcagagac tacgccaacctgtgctttgaggcctttggggaccgtgtgaagcactggatcacgttcagt gatcctcgggcaatggcagaaaaaggctatgagacgggccaccatgcgccgggcctgaag ctccgcggcaccggcctgtacaaggcagcacaccacatcattaaggcccacgccaaagcc tggcattcttataacaccacgtggcgcagcaagcagcaaggtctggtgggaatttcattg aactgtgactggggggaacctgtggacattagtaaccccaaggacctagaggctgccgag agatacctacagttctgtctgggctggtttgccaaccccatttatgccggtgactacccc caagtcatgaaggactacattgctataaaagatggtgctaatataaaggggtatacttcc tggtctctgttggataagtttgaatgggagaaaggatactcagatagatatggattctac tatgttgaatttaacgacagaaataagcctcgctatccaaaggcttcagttcaatattac aagaagattatcattgccaatgggtttcccaatccaagagagcaaatcgacattgataat gtagaaagaactaaacgtaaggtacttattacaaagtatttgacaaagagactaaaatgt gaacataaataccttatagaggacttcatcagcacacttcacttgaaatgcacctggctg cagccatttagtgaagtcttcatgtga >gi568815583r:66448493_66665365|GENSCAN_predicted_peptide_7|165_aa MTGSQQWDVRSNGCYFLVKAEEREEKEEEGEEEKEGEEDKEEEKEEEKEEEKRKKEKEKQ EEEGKKKRKKEEEEKEEEEEEGEEEGEKEEEGEKEEEEGEKEEEEEEGENGEGEKGGRGG GEGGREPPPFTLQKLEAEDLAGIFSSQPWGSPQVEMGGEKCKARS >gi568815583r:66448493_66665365|GENSCAN_predicted_CDS_7|498_bp atgactggttctcagcaatgggatgtgagaagtaatgggtgttacttcctggtcaaggca gaagagagagaagagaaggaggaggagggggaggaagagaaggagggggaggaagataag gaggaagagaaggaggaggagaaggaagaggagaagaggaagaaagagaaggagaagcag gaggaggaggggaagaagaagaggaagaaagaggaggaggagaaggaggaggaggaagag gagggggaggaggaaggggagaaggaagaggaaggggagaaggaggaagaggaaggggag aaggaggaggaggaagaggagggggagaatggggagggggagaaggggggaaggggaggg ggagaagggggaagggaacctcctcctttcacccttcagaaactggaagcagaggacttg gctggcatcttcagcagccagccatggggttccccacaggtggaaatggggggagaaaaa tgcaaggcacgatcctga >gi568815583r:66448493_66665365|GENSCAN_predicted_peptide_8|89_aa MRIISFDSQNHPTHQVLTSPLYKQGNEGPATLGKRPEFLFPEVCSLLLSVDRHQVLHDPF YQETSKDKFPEYLDGADGEDTDAAITLPT >gi568815583r:66448493_66665365|GENSCAN_predicted_CDS_8|270_bp atgaggatcatctcctttgattctcagaaccaccctacacaccaggtactcacatcccca ctttacaagcaaggaaatgaaggcccagcaacattgggcaaacggcctgaattcctcttc cctgaggtctgcagcctcctgctctcagtagacaggcaccaggttctccatgatcctttc taccaggagacatctaaggacaaattccctgaatacttggatggagctgatggcgaagac accgatgctgccatcactttgcccacctga >gi568815583r:66448493_66665365|GENSCAN_predicted_peptide_9|207_aa MPSLGSQPGKPSPGGAATCTRFEARTLRFPGVVTLVEQGPWASTPRLRKGQLCPHRAPRG LAERNRVTERAAVATGRSRLIGRPCEPAADPAWPSPGCPSQSWSPVLTAESILNCAAKHR GMGVVELLREAYLRITEEGYCLDSQQSVFSECKIPQTPDSSQQAWDGFSPLLTSGQGHPG VGKPVVEDIQRQCGLEFALFPAQPSRP >gi568815583r:66448493_66665365|GENSCAN_predicted_CDS_9|624_bp atgcccagcctcgggtcacagccagggaaacccagccccggaggcgcagcaacctgcacc cgtttcgaggcccgcactttgcgcttcccaggggttgttaccctggtcgagcaaggtccc tgggcgtcaactccccgtctgcggaaagggcagctttgtccccatcgcgcccctcgaggc ctggctgagaggaaccgggtgaccgagagggccgccgtggcgacgggccgctcccgcctt atcggccgcccctgcgagcccgcggccgaccccgcctggccctccccgggctgcccctcc cagagctggtcccctgtgctgacagcagagagtattctcaactgtgcagcaaagcaccgg ggcatgggggtggtggagctgctccgggaggcctatttgagaattactgaggaagggtac tgcttggactcacagcagagtgtgttctccgaatgtaagataccccaaactcctgacagc tctcaacaggcttgggatggcttttctcctctgctgacctctggccagggccaccctggt gtgggcaaaccagtggtggaggacatccagagacaatgtggcctggagtttgccttgttc cctgctcagccctccaggccctaa >gi568815583r:66448493_66665365|GENSCAN_predicted_peptide_10|114_aa MKKKEKKEKIIQSPDKLQILQEEMHMLNSDFLPASLKVNPQDTASKPNHLVGSQGYSVGG DPRGHRPNAEGLSEAPGTHILALNRSWQPGESKAVLMEPDEQPIPHCIRLLGLP >gi568815583r:66448493_66665365|GENSCAN_predicted_CDS_10|345_bp atgaagaagaaggagaagaaggagaaaattatccagagtcctgataaactacagatcctc caagaagagatgcacatgctgaacagtgacttcttgcctgcctccttaaaggtcaacccc caagatacagcatccaaacccaatcacctggtgggcagccagggctatagcgtgggagga gaccccaggggccacagacccaatgctgaaggactctcagaagcaccaggaacacacatc cttgccctcaacagatcctggcaacctggtgaatccaaagccgtccttatggaaccagat gaacaacctataccccactgtatccgtctgcttgggctgccataa