GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:47:07 Sequence gi568815590r:28248722_28461124 : 212403 bp : 45.97% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1181 1294 114 1 0 88 53 58 0.137 2.94 1.02 Intr + 34476 34767 292 2 1 3 66 163 0.122 2.31 1.03 Term + 35852 35958 107 0 2 55 49 133 0.957 4.67 1.04 PlyA + 36435 36440 6 1.05 2.00 Prom + 45365 45404 40 -5.36 2.01 Sngl + 51112 51648 537 0 0 39 35 535 0.854 39.38 2.02 PlyA + 51875 51880 6 1.05 3.00 Prom + 74696 74735 40 -3.76 3.01 Init + 80437 80562 126 0 0 78 92 110 0.651 10.80 3.02 Term + 90319 90723 405 0 0 127 42 444 0.938 38.89 3.03 PlyA + 94629 94634 6 1.05 4.11 PlyA - 96244 96239 6 1.05 4.10 Term - 100109 99998 112 1 1 114 49 233 0.999 20.03 4.09 Intr - 100507 100404 104 1 2 86 94 65 0.576 5.87 4.08 Intr - 101020 100913 108 1 0 81 82 10 0.517 0.18 4.07 Intr - 101435 101343 93 2 0 93 117 -1 0.802 3.46 4.06 Intr - 103086 102774 313 2 1 99 80 204 0.998 16.89 4.05 Intr - 103952 103852 101 2 2 108 63 130 0.999 11.41 4.04 Intr - 104687 104452 236 0 2 70 102 198 0.958 16.81 4.03 Intr - 108058 107949 110 0 2 32 93 129 0.144 7.73 4.02 Intr - 111103 110871 233 1 2 142 68 232 0.227 23.17 4.01 Init - 112403 112164 240 2 0 99 100 191 0.964 17.28 4.00 Prom - 112867 112828 40 -9.16 5.00 Prom + 113734 113773 40 -2.96 5.01 Init + 125193 125279 87 2 0 66 48 46 0.035 -0.96 5.02 Intr + 137266 137754 489 0 0 69 5 312 0.207 14.20 5.03 Intr + 152473 152654 182 0 2 10 30 110 0.019 -3.93 5.04 Term + 153795 154020 226 0 1 51 41 174 0.265 5.25 5.05 PlyA + 154361 154366 6 1.05 6.03 PlyA - 156882 156877 6 1.05 6.02 Term - 163264 162920 345 1 0 79 44 162 0.925 5.39 6.01 Init - 164221 164150 72 1 0 78 60 79 0.855 5.17 6.00 Prom - 170720 170681 40 -4.66 7.04 PlyA - 172611 172606 6 1.05 7.03 Term - 184027 183829 199 0 1 47 48 215 0.097 10.27 7.02 Intr - 203755 203523 233 1 2 63 90 91 0.254 3.37 7.01 Intr - 208209 208045 165 0 0 89 63 113 0.693 9.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:28248722_28461124|GENSCAN_predicted_peptide_1|170_aa LAAFQTHQPFSSHTHHASCASEQPCRRFSLPAACFHQLIEDQTSHNIPFSQSLIQRKALT LFNSMKYKRGEETAEEWLEASRGWLIRFKGKRHFHDIKVQGEAASADVEAAASSPEDLAK IIDEGGFTKQQIFSVEIDTATPTLSNHHPDESAAINIKARPSTSKKIMIC >gi568815590r:28248722_28461124|GENSCAN_predicted_CDS_1|513_bp ctcgctgctttccagacccaccagcctttcagctcccacacacaccacgcttcctgtgcc tctgaacagccgtgcaggcggttctcgctgcctgcggcctgcttccaccagctgatagaa gatcaaaccagccacaacattcccttcagccaaagcctaatccagaggaaggcactaact ctattcaattctatgaagtataagagaggtgaggaaacagcagaagaatggttggaagct agcagaggttggcttataaggtttaagggaaaaaggcattttcacgacataaaagtgcaa ggtgaagcagcaagtgctgatgtagaagctgcagcaagttctccagaagatctagctaag atcattgatgaaggtggcttcactaaacaacagattttcagtgtagaaattgacacagcc actccaaccctcagcaaccaccaccctgatgagtcagcagccatcaacatcaaggcaaga ccttccaccagcaaaaagattatgatttgctga >gi568815590r:28248722_28461124|GENSCAN_predicted_peptide_2|178_aa MIVHVTNRDIIYQIAYARIEGEIIVCAAYAHELPKYDVKVGLTNYAAAYCTGLLLACSLL NRFGMDNIYDGQMEVTGDEYNVENIDGQPGAFTCYLDAGLARTITGNKVFGTLKGTVDRS LSIPHSTKRFPGYDSESNAEVHQKRIMGQNVADYMRYLMEQDEDAYKKQFSQYIKKTA >gi568815590r:28248722_28461124|GENSCAN_predicted_CDS_2|537_bp atgatagttcatgtaacaaacagagatatcatttatcagattgcttatgcccgtatagag ggggaaattatagtctgcgcggcatatgcacacgaactgccaaaatatgatgtgaaggtt ggcctaacaaattatgctgcagcatattgtactggcctgctgctggcctgcagtcttctc aataggtttggcatggacaacatctatgacggccaaatggaggtgactggtgatgaatac aatgtggaaaacattgatggtcagccaggtgcctttacctgctatttggatgcaggcctt gccagaactatcactggcaataaagtttttggcacactgaagggaactgtggatagaagc ttgtctatccctcacagtaccaaacgattccctggttatgattctgaaagcaatgcagaa gtacaccagaagcgcatcatgggccagaatgttgcagattacatgcgctacttaatggaa caagatgaagatgcttacaagaaacagttctctcaatacataaaaaaaacagcataa >gi568815590r:28248722_28461124|GENSCAN_predicted_peptide_3|176_aa MKVLLCDLLLLSLFSSVFSSCQRDCLTCQEKLHPALDSFDLEVCILECEEKVFPSPLWTP CTKVMARSSWQLSPAAPEHVAAALYQPRASEMQHLRRMPRVRSLFQEQEEPEPGMEEAGE MEQKQLQKRFGGFTGARKSARKLANQKRFSEFMRQYLVLSMQSSQRRRTLHQNGNV >gi568815590r:28248722_28461124|GENSCAN_predicted_CDS_3|531_bp atgaaagtcctgctttgtgacctgctgctgctcagtctcttctccagtgtgttcagcagt tgtcagagggactgtctcacatgccaggagaagctccacccagccctggacagcttcgac ctggaggtgtgcatcctcgagtgtgaagagaaggtcttccccagccccctctggactcca tgcaccaaggtcatggccaggagctcttggcagctcagccctgccgccccagagcatgtg gcggctgctctctaccagccgagagcttcggagatgcagcatctgcggcgaatgccccga gtccggagcttgttccaggagcaggaagagcccgagcctggcatggaggaggctggtgag atggagcagaagcagctgcagaagagatttgggggcttcaccggggcccggaagtcggcc aggaagttggccaatcagaagcggttcagtgagtttatgaggcaatacttggtcctgagc atgcagtccagccagcgccggcgcaccctgcaccagaatggtaatgtgtag >gi568815590r:28248722_28461124|GENSCAN_predicted_peptide_4|549_aa MASVLSRRLGKRSLLGARVLGPSASEGPSAAPPSEPLLEGAAPQPFTTSDDTPCQEQPKE VLKAPSTSGLQQVAFQPGQKVYVWYGGQECTGLVEQHSWMEGQVTVWLLEQKLQVCCRVE EVWLAELQGPCPQAPPLEPGAQALAYRPVSRNIDVPKRKSDAVEMDEMMAAMVLTSLSCS PVVQSPPGTEANFSASRAACDPWKESGDISDSGSSTTSGHWSGSSGVSTPSPPHPQASPK YLGDAFGSPQTDHGFETDPDPFLLDEPAPRKRKNSVKVMYKCLWPNCGKVLRSIVGIKRH VKALHLGDTVDSDQFKREEDFYYTEVQLKEESAAAAAAAAAGTPVPGTPTSEPAPTPSMT GLPLSALPPPLHKAQSSGPEHPGPESSLPSGALSKSAPGSFWHIQADHAYQALPSFQIPV SPHIYTSVSWAAAPSAACSLSPDFNAEERKGENVEAVWAGSEISRPLETRAIGGPPTQVR SRSLSFSEPQQPAPAMKSHLIVTSPPRAQSGARKARGEAKKCRKVYGIEHRDQWCTACRW KKACQRFLD >gi568815590r:28248722_28461124|GENSCAN_predicted_CDS_4|1650_bp atggcgagtgtcctgtcccgacgccttggaaagcggtccctcctgggagcccgggtgttg ggacccagtgcctcggaggggccctcggctgccccaccctcggagccactgctagaaggg gccgctccccagcctttcaccacctctgatgacaccccctgccaggagcagcccaaggaa gtccttaaggctcccagcacctcgggccttcagcaggtggcctttcagcctgggcagaag gtttatgtgtggtacgggggtcaagagtgcacaggactggtggagcagcacagctggatg gagggtcaggtgaccgtctggctgctggagcagaagctgcaggtctgctgcagggtggag gaggtgtggctggcagagctgcagggcccctgtccccaggcaccacccctggagcccgga gcccaggccctggcctacaggcccgtctccaggaacatcgatgtcccaaagaggaagtcg gacgcagtggaaatggatgagatgatggcggccatggtgctgacgtccctgtcctgcagc cctgttgtacagagtcctcccgggaccgaggccaacttctctgcttcccgtgcggcctgc gacccatggaaggagagtggtgacatctcggacagcggcagcagcactaccagcggtcac tggagtgggagcagtggtgtctccaccccctcgcccccccacccccaggccagccccaag tatttgggggatgcttttggttctccccaaactgatcatggctttgagaccgatcctgac cctttcctgctggacgaaccagctccacgaaaaagaaagaactctgtgaaggtgatgtac aagtgcctgtggccaaactgtggcaaagttctgcgctccattgtgggcatcaaacgacac gtcaaagccctccatctgggggacacagtggactctgatcagttcaagcgggaggaggat ttctactacacagaggtgcagctgaaggaggaatctgctgctgctgctgctgctgctgcc gcaggcaccccagtccctgggactcccacctccgagccagctcccacccccagcatgact ggcctgcctctgtctgctcttccaccacctctgcacaaagcccagtcctccggcccagaa catcctggcccggagtcctccctgccctcaggggctctcagcaagtcagctcctgggtcc ttctggcacattcaggcagatcatgcataccaggctctgccatccttccagatcccagtc tcaccacacatctacaccagtgtcagctgggctgctgccccctccgccgcctgctctctc tctccggattttaatgctgaggagaggaagggggagaatgtggaggctgtgtgggctggg tcagaaatctcaaggccactggagacaagggccataggtggacccccaactcaagtccgg agccggtcgctaagcttcagcgagccccagcagccagcacctgcgatgaaatctcatctg atcgtcacttctccaccccgggcccagagtggtgccaggaaagcccgaggggaggctaag aagtgccgcaaggtgtatggcatcgagcaccgggaccagtggtgcacggcctgccggtgg aagaaggcctgccagcgctttctggactga >gi568815590r:28248722_28461124|GENSCAN_predicted_peptide_5|327_aa MHLCYNVQLKNPTAQQFHFLSTYPRENSQVPAQLTSARHERQHAPGFPGSDGSARTASSA PGCSGRGPARPPGAARGPAPLRPPAEARLAPTWPACPPPRDTALTYAAAATTRPGFPPGK GAEQHASPCQPAPSGDTRDTPPPAPRASAPPHPPSTPPARLGYTPPAARRQSALNKCPSD SSRTCALEPDRVGDAATESFPGLGEVAETPSGTTRVRSPPPKQRCPDAPLGAQSAACTQP LATTYRVCAHVCTWELAMASPPGEGCSGFPDDFAAVAASASAASERCPAAGLSPLPQVPA QQSLLSFLPVSTEPDISSPLNAQAPAA >gi568815590r:28248722_28461124|GENSCAN_predicted_CDS_5|984_bp atgcatctttgttacaatgtacaactcaagaaccctacggcccagcagttccactttctc agtacctaccccagagaaaactcacaggtccccgcgcagctcacctcggcccgccacgag cggcagcacgcgcccggcttccccggctcggacggctcggctcggacggctagctcagct ccgggctgcagcggccgcggccctgcccgccctcccggcgcggcccgcggcccggctccg ctccggccgcccgccgaggctcggttggcgcccacctggccggcctgccctcctccccgg gacactgcgctcacgtacgcggccgcggccacaacccggcccggattcccgcccgggaag ggagcggagcagcacgccagcccctgccagccggcgcccagcggcgacactcgggacaca cccccgcccgccccccgcgcctccgcgcccccgcacccgcccagcacgcccccagcgcgc ctgggctacacacccccggccgcccgtagacagtcggcgctcaataagtgcccgagcgac tcctcgcgcacatgcgcacttgaacctgaccgagtgggcgatgctgccaccgagtcattt cctggattaggagaagtggcagagacgccctcagggaccacacgtgtcagatctccacct cccaagcagcgttgcccggacgcgccgctcggggcccagagcgctgcgtgcacgcagcct ctagcaaccacttaccgcgtgtgtgcgcatgtgtgtacctgggaactggctatggcctcg ccgcccggcgagggctgcagtgggttccccgacgactttgccgcggttgctgcttctgcc agcgcagcctcggagcgctgcccggctgcagggctctcacctcttccccaggtccctgct cagcagagcctcttaagcttcctgcctgtgtccacagaacccgacatctcaagccccctg aacgcacaggctcctgctgcataa >gi568815590r:28248722_28461124|GENSCAN_predicted_peptide_6|138_aa MGLQMEVKDGPKAFGGSDLDKEAEARALPPAAAGVEDRAGGGRRAEAAGCPRLEGRAGER KRSLSRNSHRAAEGLPGPSEPQDRRGPASSSALAVGGAARKPDAAAPWPMASRIFPAAAG AGLRHLQNFQEGAWRKST >gi568815590r:28248722_28461124|GENSCAN_predicted_CDS_6|417_bp atgggtttacagatggaggtgaaagatgggcccaaggcctttggtggatcagacttagac aaagaagctgaggcccgggcgcttccgccggccgcagccggggttgaggaccgagcagga gggggacgccgggctgaagccgcgggttgtccccgcctcgagggcagggccggggagcgc aaacgcagcctttccaggaactcgcaccgcgctgcagaggggctgccgggccccagcgag ccgcaggacagacgcggcccggcatcctcgtccgcactcgcagtggggggagcagcgcgg aaacccgacgccgccgctccctggccgatggcctctcgaatcttcccggcggcggccggc gcggggctgcgtcacctgcagaactttcaggaaggggcctggaggaaatcaacctga >gi568815590r:28248722_28461124|GENSCAN_predicted_peptide_7|198_aa VCWHWKNLAELDQLWMLKCLRFNWYINFSPTPFEQGIWKKHYIQMVKELHITKPKTPPKD GFVIADVQLVTSNSPEEKQSPLSAFRSSSSLRKKNNSGEKALPPWRSSDKHPTDIIRFNY LDNRDPMETVQQGAALGMLMDGMAIGNVHLPESRQLQQRRLHIRKGSFMLVSFVKANVEN GPNIYRYGSSGCELLYSH >gi568815590r:28248722_28461124|GENSCAN_predicted_CDS_7|597_bp gtgtgctggcattggaagaaccttgctgagctggaccagctctggatgctgaaatgttta cggtttaactggtacatcaatttctctccaactccctttgagcaggggatctggaagaag cactatattcaaatggtgaaagaacttcatattaccaagcctaagacacccccaaaggat ggatttgtaatcgctgacgttcaactagttacaagcaattctccagaggaaaaacagtcc cctttatcagcttttcggtcctcttcctctttaagaaagaagaataactcaggggagaaa gcacttccaccctggcgatcttctgataagcacccaacagatatcattcgttttaattac ctagacaaccgtgaccccatggagactgtccagcaaggggcagcactaggaatgctgatg gacggcatggccattggcaacgtccacctaccagaaagcagacagctgcagcagaggagg ctgcacattcgtaaagggtcattcatgttggtgtcgtttgtgaaagcaaacgttgaaaat ggcccaaatatctatcggtatggttcatctggatgtgaactgctctacagccattga