GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:28:56 Sequence gi568815587r:8837619_9038161 : 200543 bp : 43.51% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 40 35 6 1.05 1.03 Term - 19670 19515 156 2 0 81 44 87 0.337 1.53 1.02 Intr - 33230 33148 83 0 2 82 61 20 0.291 -1.94 1.01 Init - 33447 33336 112 0 1 48 99 145 0.843 9.97 1.00 Prom - 41414 41375 40 -6.66 2.00 Prom + 54078 54117 40 -5.06 2.01 Init + 56008 57134 1127 0 2 60 53 296 0.603 16.97 2.02 Term + 57566 57638 73 2 1 72 50 38 0.407 -4.22 2.03 PlyA + 58626 58631 6 1.05 3.00 Prom + 69302 69341 40 -4.66 3.01 Init + 73832 74053 222 1 0 103 78 136 0.630 12.67 3.02 Intr + 77208 77312 105 2 0 53 111 41 0.890 3.31 3.03 Intr + 81719 81856 138 1 0 128 -14 153 0.015 9.76 3.04 Intr + 88027 88161 135 0 0 94 33 75 0.002 3.36 3.05 Intr + 94428 94544 117 2 0 105 105 33 0.440 7.36 3.06 Term + 95160 95234 75 2 0 60 55 71 0.254 -1.16 3.07 PlyA + 96994 96999 6 1.05 4.03 PlyA - 98997 98992 6 -0.45 4.02 Term - 100555 99998 558 1 0 73 44 464 0.339 34.85 4.01 Init - 107771 107541 231 2 0 63 47 86 0.170 0.26 4.00 Prom - 108261 108222 40 -5.56 5.06 PlyA - 108420 108415 6 1.05 5.05 Term - 110857 110702 156 1 0 58 32 194 0.992 8.73 5.04 Intr - 115719 115585 135 0 0 54 98 19 0.525 0.26 5.03 Intr - 118680 118572 109 2 1 79 93 93 0.884 9.19 5.02 Intr - 124565 124474 92 2 2 70 111 4 0.831 -0.31 5.01 Init - 126695 126591 105 2 0 98 113 274 0.999 29.12 5.00 Prom - 133110 133071 40 -7.36 6.17 PlyA - 133154 133149 6 1.05 6.16 Term - 134968 134905 64 0 1 102 42 72 0.220 1.36 6.15 Intr - 146351 146257 95 2 2 43 89 103 0.570 4.66 6.14 Intr - 146506 146454 53 2 2 88 116 26 0.996 4.03 6.13 Intr - 148232 148093 140 1 2 45 99 101 0.997 7.01 6.12 Intr - 150012 149930 83 0 2 88 98 18 0.975 1.24 6.11 Intr - 150643 150500 144 1 0 55 95 101 0.447 7.98 6.10 Intr - 166304 166144 161 0 2 34 89 366 0.361 31.01 6.09 Intr - 166953 166731 223 0 1 68 99 149 0.105 11.70 6.08 Intr - 182453 182327 127 1 1 97 28 27 0.002 -1.72 6.07 Intr - 183579 183449 131 0 2 87 7 106 0.177 1.89 6.06 Intr - 184337 184258 80 0 2 46 113 80 0.914 5.57 6.05 Intr - 185138 185094 45 0 0 80 107 9 0.518 0.38 6.04 Intr - 188236 188084 153 2 0 92 103 192 0.998 21.14 6.03 Intr - 189943 189746 198 2 0 80 38 329 0.980 26.42 6.02 Intr - 193307 193140 168 0 0 92 105 73 0.988 9.32 6.01 Intr - 196178 196008 171 0 0 49 94 54 0.205 2.01 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 81719 81862 144 1 0 128 41 150 0.983 12.21 S.002 Init + 180254 180314 61 1 1 91 80 100 0.890 8.92 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:8837619_9038161|GENSCAN_predicted_peptide_1|116_aa MPARRPPSLQERLLLRAARLPARPHPGAAPGPAGERRAVSSSVSTRVGAAAGAEERGWWG PPSRARFVVVFGEDDMKLSGNMCILSFLIEFGDKKEETWYFALILYAAINDILVSR >gi568815587r:8837619_9038161|GENSCAN_predicted_CDS_1|351_bp atgcctgcccgccgcccgccctctctgcaggagcggctcctcctccgggccgcgcggctc ccggcgagaccccatccaggcgccgcgcccggcccggctggggaacgcagagcggtgtcc agctccgtgagtacgcgcgtgggggccgccgcaggcgcagaggagcggggctggtggggt ccgccctcccgggcgagatttgtggtggtttttggagaagatgacatgaaattatcaggc aatatgtgtatactcagctttctaatagagtttggggataagaaggaggagacctggtat tttgccttaattctgtatgcggctataaatgacattttagtatccagatga >gi568815587r:8837619_9038161|GENSCAN_predicted_peptide_2|399_aa MSELPFTTASKRIKYLVIQLTRDVKDLFKENYKPLLNETKEDTNKWKNIPCSWIGRINIV KMAILPKVIYRFNAIPMKLPTTFFTELEKTTVKFIWNQKRAHIAKSILSQKNKAGGIMLP DFKLYYKATVTKTAWYWYQNRDLDQWNRTEPSEIIPDIYNHLIFDEPDKNKKWGKDSLFN KWCWENLLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGIGKD FISKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRI YNELQQIYKKKTNNPIKKWAKDMNRHFSKEDIYAANRHIKKCSPSLAIREMQIKTTMRYH LTPIRMAVIKKSGNNRWEVNNENTWTQEGEHHTPGPVVG >gi568815587r:8837619_9038161|GENSCAN_predicted_CDS_2|1200_bp atgagtgaactcccattcacaactgcttcaaagagaataaaatacctagtaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccgttgctcaacgaaacaaaa gaggacacaaacaaatggaagaacattccatgctcatggataggaagaatcaatattgtg aaaatggccatactacccaaggtaatttatagattcaatgccatccccatgaagctacca acgactttcttcacagaattggaaaaaactactgtaaagttcatatggaaccaaaaaaga gcccacattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcatgctacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatctagaccaatggaacagaacagagccctcagaaataataccagacatctacaac catctgatctttgacgaacctgacaaaaacaagaaatggggaaaggattccctatttaat aaatggtgctgggaaaacttgctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggattaaagacttaaatgttcgacctaaaacc ataaaaaccctagaagaaaacttaggcaataccattcaggacataggcataggcaaggac ttcatatctaaaacaccaaaagcaatggcaacaaaagcgaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acaaaatgggagaaaatttttgcaacctactcatctgacaaagggctaatatccagaatc tacaatgaactccaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggca aaggatatgaacagacacttctcaaaagaagacatttatgcagccaacagacacatcaaa aaatgctcaccatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccat ctcacaccaattagaatggcggtcattaaaaagtcaggaaacaacaggtgggaagtgaac aatgagaacacttggacacaggaaggggaacatcacacaccagggcctgttgtggggtag >gi568815587r:8837619_9038161|GENSCAN_predicted_peptide_3|263_aa MDNCLAAAALNGVDRRSLQRSARLALEVLERAKRRAVDWHALERPKGCMGVLAREAPHLE KQPAAGPQRVLPGEKYYSSVPEEGGATHVYRYHRGESKLHMCLDIGNGQAENISKDLYIE VYPGTYSVTVGSNDLTKKTHVVAVDSGQSVDLVFPEEDEEEETARGACIATFSSLGPSRG KSCAVGSNSGSSLAMTSEVLEASLSTKVYHQQPRTKEKEKGIHFPQRGRDRYLARGLCPP CSEETAATRAQDQAALLQPTRAP >gi568815587r:8837619_9038161|GENSCAN_predicted_CDS_3|792_bp atggacaactgtttggcggccgcagcgctgaatggggtggaccgacgttccctgcagcgt tcagcaaggctggctctagaagtgctggagagggccaagaggagggcggtggactggcat gccctggagcgtcccaaaggctgcatgggggtccttgcccgggaggcgccccacctagag aaacagccggcagccggcccgcagcgcgttctcccgggagagaaatattattcatctgtg ccagaggaaggaggggcaacccatgtctatcgttatcacagaggcgagtcgaagctgcac atgtgcttggacatagggaatggtcaggctgagaacatctctaaggacctctacatagaa gtatatccagggacctattctgtcactgtgggctcaaatgacttaaccaagaagactcat gtggtagcagttgattctggacaaagcgtggacctggtcttccctgaggaggatgaggag gaagaaacagccaggggagcgtgcattgctactttctcctctttaggaccttccaggggc aaaagctgtgctgtgggctccaactctggaagttctctggccatgacttctgaggtcctg gaagcaagtctgtccaccaaggtctaccaccaacagccaagaacaaaggaaaaagaaaag ggcatccacttcccccagaggggcagagacaggtaccttgcaaggggcttgtgcccgccc tgttcagaggagacagctgccaccagggcccaagatcaagccgcactgctgcagcccacc agagccccgtga >gi568815587r:8837619_9038161|GENSCAN_predicted_peptide_4|262_aa MGKSPGHLTKEEIQMANKHMKRCSTSYVIREMQTETTMGYQYTPIRMTKIYDTDNTKCSQ GCGATGTFFYCWWEYKMVKEEMMDNRGNSSLPDKLPIFPDSARLPLTRSFYLEPMVTFHV HPEAPVSSPYSEELPRLPFPSDSLILGNYSEPCPFSFPMPYPNYRGCEYSYGPAFTRKRN ERERQRVKCVNEGYAQLRHHLPEEYLEKRLSKVETLRAAIKYINYLQSLLYPDKAETKNN PGKVSSMIATTSHHADPMFRIV >gi568815587r:8837619_9038161|GENSCAN_predicted_CDS_4|789_bp atgggcaaaagtcctggacacctcaccaaagaggagatacagatggcaaataagcatatg aaaagatgttccacatcatatgtcatcagggaaatgcaaactgaaacaacaatgggatac caatacacacctattagaatgaccaaaatctacgacactgataacaccaaatgctctcaa ggatgtggagcaacaggaactttcttttattgctggtgggaatacaaaatggttaaagag gaaatgatggacaacagaggcaactctagtctacctgacaaacttcctatcttccctgat tctgcccgcttgccactgaccaggtccttctatctggagcccatggtcactttccacgtg cacccagaggccccggtgtcatccccttactctgaggagctgccacggctgccttttccc agcgactctcttatcctgggaaattacagtgaaccctgccccttctctttcccgatgcct tatccaaattacagagggtgcgagtactcctacgggccagccttcacccggaaaaggaat gagcgggaaaggcagcgggtgaaatgtgtcaatgaaggctacgcccagctccgccatcat ctgccagaggagtatttggagaagcgactcagcaaagtggaaaccctcagagctgcgatc aagtacattaactacctgcagtctcttctgtaccctgataaagctgagaccaagaataac cctggaaaagtttcctccatgatagcaaccaccagccaccatgctgaccctatgttcaga attgtttga >gi568815587r:8837619_9038161|GENSCAN_predicted_peptide_5|198_aa MATLWGGLLRLGSLLSLSCLALSVLLLAQLSDAAKNFEDVRCKCICPPYKENSGHIYNKN ISQKDCDCLHVVEPMPVRGPDVEAYCLRCECKYEERSSVTIKVTIIIYLSILGLLLLYMV YLTLVEPILKRRLFGHAQLIQSDDDIGDHQPFANAHDVLARSRSRANVLNKVEYAQQRWK LQVQEQRKSVFDRHVVLS >gi568815587r:8837619_9038161|GENSCAN_predicted_CDS_5|597_bp atggcgaccctgtggggaggccttcttcggcttggctccttgctcagcctgtcgtgcctg gcgctttccgtgctgctgctggcgcagctgtcagacgccgccaagaatttcgaggatgtc agatgtaaatgtatctgccctccctataaagaaaattctgggcatatttataataagaac atatctcagaaagattgtgattgccttcatgttgtggagcccatgcctgtgcgggggcct gatgtagaagcatactgtctacgctgtgaatgcaaatatgaagaaagaagctctgtcaca atcaaggttaccattataatttatctctccattttgggccttctacttctgtacatggta tatcttactctggttgagcccatactgaagaggcgcctctttggacatgcacagttgata cagagtgatgatgatattggggatcaccagccttttgcaaatgcacacgatgtgctagcc cgctcccgcagtcgagccaacgtgctgaacaaggtagaatatgcacagcagcgctggaag cttcaagtccaagagcagcgaaagtctgtctttgaccggcatgttgtcctcagctaa >gi568815587r:8837619_9038161|GENSCAN_predicted_peptide_6|678_aa XSCRAGTYYDGARERCILCPNGTFQNEEGQMTCEPCPRPGNSGALKTPEAWNMSECGGLC QPGEYSADGFAPCQLCALGTFQPEAGRTSCFPCGGGLATKHQGATSFQDCETRDRRCGGE LGDFTGYIESPNYPGNYPANTECTWTINPPPKRRILIVVPEIFLPIEDDCGDYLVMRKTS SSNSVTTYETCQTYERPIAFTSRSKKLWIQFKSNEGNSARGFQVPYVTYDGQREEEKQRS LREVTEDYQELIEDIVRDGRLYASENHQEILKDKKLIKALFDVLAHPQNYFKYTAQESRE MFPRSFIRLLRSKVSSLKCERRTLLRGHEQPLEGWSHSLTVTALPETPPECHTIISLWSR GPVGGDRPEVGEIRSAPNLGGSRQSSGPGRWTLEPRLAAWRCVSEKPSSGAGGGTRGMAR LSVIPGSATAWTGLLTEGGRKETDMREAASLRQQRRMKQAVQFIHKDSADLLPLDGLKKL GSSKDMRRLMETNLSKLRSGPRVPWASKTNKLNQAKSEGLKKSEEDDMILVSCQCAGKDV KALVDTGCLYNLISLACVDRLGLKEHVKSHKHEGEKLSLPRHLKVVGQIEHLVITLGSLR LDCPAAVVDDNEKNLSLGLQTLRSLKCIINLDKHRLIMGKTDKEEIPFVETVSLNEDNPP ITGPGDGVDDVLAPHYYF >gi568815587r:8837619_9038161|GENSCAN_predicted_CDS_6|2037_bp ntcagttgcagggctgggacctattatgatggagcacgagaacgctgcattttatgtcca aatggaaccttccaaaatgaggaaggacaaatgacttgtgaaccatgcccaagaccagga aattctggggccctgaagaccccagaagcttggaatatgtctgaatgtggaggtctgtgt caacctggtgaatattctgcagatggctttgcaccttgccagctctgtgccctgggcacg ttccagcctgaagctggtcgaacttcctgcttcccctgtggaggaggccttgccaccaaa catcagggagctacttcctttcaggactgtgaaaccagagacagaagatgtggaggggag ctgggagatttcactgggtacattgaatccccaaactacccaggcaattacccagccaac accgagtgtacgtggaccatcaacccaccccccaagcgccgcatcctgatcgtggtccct gagatcttcctgcccatagaggacgactgtggggactatctggtgatgcggaaaacctct tcatccaattctgtgacaacatatgaaacctgccagacctacgaacgccccatcgccttc acctccaggtcaaagaagctgtggattcagttcaagtccaatgaagggaacagcgctaga gggttccaggtcccatacgtgacatatgatggtcagagagaagaagaaaagcaaagaagc ctcagagaagtaaccgaggactaccaggaactcattgaagacatagttcgagatggcagg ctctatgcatctgagaaccatcaggaaatacttaaggataagaaacttatcaaggctctg tttgatgtcctggcccatccccagaactatttcaagtacacagcccaggagtcccgagag atgtttccaagatcgttcatccgattgctacgttccaaagtgtccagtctcaaatgtgag agacggactctgctccgtggacatgagcagccccttgaagggtggtctcattctctaaca gtaacagcactgccagaaacaccccctgaatgccacacgatcatctcactttggtcacgt ggcccggtgggtggggaccgacctgaagttggagaaatccggagcgctcccaacctcgga gggagtcgccagtcctccgggcccgggcggtggaccctggagccccggctggcggcgtgg aggtgcgtttctgagaagccgagcagcggcgcgggcggcgggactcgaggcatggcccgg ctgtcggtgatccccgggtcggccacggcgtggacagggctcctcactgagggcggccgc aaggagaccgacatgcgggaggcggcgtcactgcgacagcagcgccggatgaagcaggcg gtgcagttcatccacaaggactccgccgacctgctgcccctggacggcctcaagaagctg ggctcgtccaaggacatgaggcgcctcatggaaaccaacctgtctaagctccgaagcggt ccccgtgtcccttgggcctctaagacgaacaaactcaatcaggctaagtctgaggggcta aagaagtctgaggaggatgacatgattttggtttcttgccagtgtgctggaaaggatgtg aaagccttggttgacacaggctgcctatataatctcatctctttggcctgtgtggacaga ttgggactcaaggagcatgtcaaatcccacaagcatgaaggagaaaagctttctctaccc cggcatctcaaagtagtgggccagattgagcacctagtgatcacactgggctccctccgc ctggactgcccagcagctgtggttgatgacaatgagaaaaacttgtcccttggtctacag actctccgatctctgaagtgcatcataaacttggataagcaccggctgatcatggggaag acagacaaggaagaaatcccttttgtggagacagtctctttgaatgaagacaatcctcct attactggtcctggagatggggtagatgatgttcttgctcctcattactacttctag