GENSCAN 1.0 Date run: 8-Nov-116 Time: 10:14:01 Sequence gi568815582f:85701206_85905112 : 203907 bp : 50.32% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 358 353 6 1.05 1.10 Term - 6861 6803 59 1 2 67 55 55 0.478 -2.05 1.09 Intr - 9102 8959 144 1 0 50 94 254 0.740 22.45 1.08 Intr - 34030 33985 46 1 1 130 116 31 0.864 8.08 1.07 Intr - 40484 40364 121 1 1 105 7 64 0.525 0.50 1.06 Intr - 40852 40530 323 1 2 6 23 235 0.586 3.46 1.05 Intr - 41932 41714 219 1 0 77 46 250 0.792 18.20 1.04 Intr - 49833 49721 113 1 2 137 101 34 0.491 9.70 1.03 Intr - 53176 53075 102 2 0 94 100 -11 0.355 0.85 1.02 Intr - 55049 54921 129 0 0 76 69 39 0.429 1.47 1.01 Init - 71844 71790 55 0 1 101 45 44 0.167 0.85 1.00 Prom - 72491 72452 40 -1.86 2.07 PlyA - 73263 73258 6 1.05 2.06 Term - 78662 78503 160 1 1 70 48 213 0.991 12.91 2.05 Intr - 79268 79174 95 2 2 106 81 84 0.978 8.26 2.04 Intr - 80121 80006 116 1 2 66 115 169 0.992 17.57 2.03 Intr - 85344 85323 22 0 1 48 96 42 0.659 -1.88 2.02 Intr - 87845 87735 111 2 0 106 55 132 0.815 12.28 2.01 Init - 98090 97860 231 2 0 90 107 430 0.857 43.36 2.00 Prom - 99383 99344 40 -8.66 3.00 Prom + 99798 99837 40 -6.56 3.01 Init + 100001 100073 73 1 1 38 115 1 0.030 -0.67 3.02 Intr + 103732 103899 168 2 0 46 15 181 0.471 6.52 3.03 Intr + 104528 104659 132 0 0 98 101 108 0.978 13.72 3.04 Term + 105533 105669 137 0 2 47 54 181 0.988 8.78 3.05 PlyA + 105774 105779 6 1.05 4.05 PlyA - 114599 114594 6 1.05 4.04 Term - 115351 115050 302 2 2 11 48 389 0.427 22.58 4.03 Intr - 118878 118718 161 2 2 57 105 111 0.533 9.33 4.02 Intr - 119024 118895 130 0 1 65 8 83 0.482 -2.25 4.01 Init - 120346 120289 58 1 1 58 94 71 0.829 5.12 4.00 Prom - 121420 121381 40 -7.36 5.00 Prom + 121746 121785 40 -5.66 5.01 Sngl + 122032 122751 720 0 0 36 37 2179 0.934 203.83 5.02 PlyA + 122793 122798 6 1.05 6.00 Prom + 123903 123942 40 -9.26 6.01 Init + 125283 125634 352 2 1 74 1 207 0.595 7.93 6.02 Intr + 126750 126888 139 1 1 113 68 -11 0.027 -1.08 6.03 Intr + 128009 128141 133 2 1 64 88 7 0.025 -1.05 6.04 Intr + 131494 131643 150 0 0 83 -18 162 0.553 5.46 6.05 Intr + 133951 134102 152 0 2 40 88 84 0.406 2.56 6.06 Intr + 134470 134610 141 1 0 34 91 80 0.572 2.37 6.07 Intr + 139352 139444 93 2 0 83 30 82 0.024 0.98 6.08 Intr + 152710 152824 115 1 1 40 121 51 0.335 4.05 6.09 Intr + 161083 161270 188 0 2 105 -18 109 0.248 0.49 6.10 Intr + 161452 161647 196 1 1 53 55 149 0.371 7.52 6.11 Term + 175277 175468 192 1 0 69 32 118 0.018 1.72 6.12 PlyA + 175727 175732 6 1.05 7.00 Prom + 176490 176529 40 -2.46 7.01 Init + 176613 176740 128 2 2 78 63 68 0.074 2.93 7.02 Intr + 184070 184257 188 2 2 88 37 87 0.050 2.93 7.03 Intr + 188677 188761 85 2 1 82 99 21 0.122 1.48 7.04 Intr + 189017 189041 25 2 1 91 121 12 0.238 2.83 7.05 Intr + 197935 198018 84 0 0 48 96 77 0.916 4.52 7.06 Intr + 201012 201068 57 2 0 93 75 45 0.649 2.78 7.07 Intr + 201330 201430 101 2 2 54 13 83 0.735 -3.69 7.08 Intr + 201810 201984 175 0 1 55 98 220 0.966 19.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 197556 197738 183 2 0 25 64 164 0.905 4.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:85701206_85905112|GENSCAN_predicted_peptide_1|436_aa MGARWLTPVIPALWEAEAERQSEVHSIRLKSQAEAVPTSLGPTLMLLNKPVPARGPGIGL SGDSDHRQVWGHSPGLVSGVCRFSAPCSSSQVVGQGSSPRPPEPDRRRHHHQRPGGPRAP RARLRSEPTKKEAEAGPRSQREWTGLSTLPHKFTPNTTGHTVELGPRSGGALVVPSGNHT DDDDEDDDGKGGSNNDDSDKDDNSSEEPIGSELIHYCEDGTKQAVHERLPHDPDTSHQAQ CSESNFNMRFGRDKYPNYSNKNDENNDDDEDDGDNSVPGPVLEAGGSMPVWQRRTQNLGD RAPAPQTREMTCLSQSCACRLCAVLGGRQKQDDNEGKFRDTVGDEKTCCVKTLHACRGAD MGLKMSCLKGFQMCVSSSSSSHDEAPVLNDKHLDVPDIIITPPTPTGMMLPRDLGSTVWL DETGSCPDDGEIDPEA >gi568815582f:85701206_85905112|GENSCAN_predicted_CDS_1|1311_bp atgggggcgcggtggctcacgcctgtaatcccagcactttgggaggctgaggcagaaaga caaagtgaggttcattccatccgtctgaaatctcaggcagaggctgttccgacgtccctg ggccccacgctgatgctgctcaataaaccagtgcctgccaggggcccgggcatagggctc agtggggattctgatcacaggcaggtgtggggccacagccctggcctcgtctccggggtc tgcaggttcagtgctccctgcagctcttcccaggtggtgggacaaggctcgtcgccgcgg ccccccgagcccgaccgccgccgccaccaccaccagcgcccgggcgggcctcgcgcgcct cgggcgcggctccgcagtgagcccaccaagaaggaagcggaggctggccccagatctcaa agggagtggacagggctctccacactcccacataaattcacccccaacacaacaggccac actgtggaattggggccacgtagtgggggtgccctggtggtacctagtggcaatcacact gatgacgatgatgaggatgatgatggtaaaggtggcagcaacaatgatgacagtgacaag gatgacaacagtagtgaggaaccaatagggagcgaactcatccattactgtgaggacggc accaagcaagctgttcatgagagattgccccatgacccagacacctcccatcaggcccag tgttctgaatcaaatttcaacatgaggtttggaagggacaaatatccaaactatagcaat aagaatgatgagaacaatgatgatgatgaagatgatggtgacaactctgtcccaggccct gtcctggaggctggtggctcaatgcctgtttggcagaggaggacacagaacctgggggac cgagccccagcacctcagactcgggaaatgacctgcttgagccagtcctgtgcctgccgc ctttgtgctgtccttggagggaggcagaagcaggatgacaatgagggcaagttcagggac actgtgggagatgagaaaacttgctgtgtgaaaactctccacgcctgcagaggtgccgac atggggcttaagatgtcctgcctgaaaggctttcaaatgtgtgtcagcagcagcagcagc agccacgacgaggcccccgtcctgaacgacaagcacctggacgtgcccgacatcatcatc acgccccccacccccacgggcatgatgctgccgagggacttggggagcacagtctggctg gatgagacagggtcgtgcccagatgatggagaaatcgacccagaagcctga >gi568815582f:85701206_85905112|GENSCAN_predicted_peptide_2|244_aa MPGVKLTTQAYCKMVLHGAKYPHCAVNGLLVAEKQKPRKEHLPLGGPGAHHTLFVDCIPL FHGTLALAPMLEVALTLIDSWCKDHSYVIAGYYQANERVKDARYVDAEPSVGEELFEEDT SVLNGINWNVVAYFFDSPNQVAEKVASRIAEGFSDTALIMVDNTKFTMDCVAPTIHVYEH HENRWRCRDPHHDYCEDWPEAQRISASLLDSRSYETLVDFDNHLDDIRNDWTNPEINKAV LHLC >gi568815582f:85701206_85905112|GENSCAN_predicted_CDS_2|735_bp atgcccggggtgaaactgaccacccaggcctactgcaagatggtgctgcacggcgccaag tacccgcactgcgccgtcaacgggctcctggtggccgagaagcagaagccgcgtaaggag cacctccccctgggcggccccggcgcccaccacaccctcttcgtggactgcatccccctc ttccacggcaccctggccctcgcccccatgctggaggtggctctcaccctgattgattca tggtgcaaagatcatagctacgtgattgctggttattatcaagctaatgagcgagtaaag gatgccaggtacgtggatgcagaaccctctgtgggcgaggaattgtttgaggaggacacc tcagtgttgaatggcattaactggaatgtggtagcctatttctttgacagtccaaaccag gttgcagagaaggtggcctccagaatcgccgagggcttcagcgacactgcgctcatcatg gtagacaacaccaagtttacgatggactgcgtagcgcctacgatccacgtgtacgagcac catgagaacagatggcggtgcagagacccacaccatgactactgtgaagactggccagag gcacagaggatctcagcctcgctcctggacagccggtcctacgagacgctcgtggatttc gataaccacctggatgacattcggaatgactggacaaacccagagatcaataaagctgtc ctacacttgtgctag >gi568815582f:85701206_85905112|GENSCAN_predicted_peptide_3|169_aa MLATRVFSLVGKRAISTSVCVRAHESVVKSEDFSLPAYMDRRDHPLPEVAHVKHLSASQK ALKEKEKASWSSLSMDEKVELYRIKFKESFAEMNRGSNEWKTVVGGAMFFIGFTALVIMW QKHYVYGPLPQSFDKEWVAKQTKRMLDMKVNPIQGLASKWDYEKNEWKK >gi568815582f:85701206_85905112|GENSCAN_predicted_CDS_3|510_bp atgttggctaccagggtatttagcctagttggcaagcgagcaatttccacctctgtgtgt gtacgagctcatgaaagtgttgtgaagagcgaagacttttcgctcccagcttatatggat cggcgtgaccaccccttgccggaggtggcccatgtcaagcacctgtctgccagccagaag gcattgaaggagaaggagaaggcctcctggagcagcctctccatggatgagaaagtcgag ttgtatcgcattaagttcaaggagagctttgctgagatgaacaggggctcgaacgagtgg aagacggttgtgggcggtgccatgttcttcatcggtttcaccgcgctcgttatcatgtgg cagaagcactatgtgtacggccccctcccgcaaagctttgacaaagagtgggtggccaag cagaccaagaggatgctggacatgaaggtgaaccccatccagggcttagcctccaagtgg gactacgaaaagaacgagtggaagaagtga >gi568815582f:85701206_85905112|GENSCAN_predicted_peptide_4|216_aa MTANSAPRAGKVAQIQMRAGYFKNKVLVHCELGQSNARRYMQLKLGKGTFFQHCKPAADP YKKGCAKSESVQRILGGEIMFPALQSCDLLQDVPAGKDPVGGRARQLEERESQQLPVSTV GEAMSSKVSRDTLYEAVWEVLRGNQHKHRKFLETVESQISLKNYDPQKDKRFSGTIRLKS TPRPKFSVCVLEDQQHCDEAKAMGIPHMDITWTSRR >gi568815582f:85701206_85905112|GENSCAN_predicted_CDS_4|651_bp atgaccgccaactccgcgccgcgggccgggaaggtggcccagattcaaatgcgggcaggc tacttcaagaataaagtcctcgtgcactgtgagttagggcagagcaatgccagacgctac atgcaactgaaactgggaaaaggcacatttttccagcattgtaagcctgccgcagaccct tataaaaagggctgtgccaagtcagagtctgtgcaaagaattcttggtggtgaaattatg tttccagccttgcagagctgtgacctgctgcaagacgttcctgctggcaaggaccctgta ggaggcagagcccggcagctagaggagagggaatcacagcagctgcctgttagcacagtg ggagaagccatgagcagcaaagtctctcgcgacaccctgtacgaggcagtgtgggaagtc ctacgtgggaaccagcacaagcaccgcaagttcctagagacagtggagtcacagatcagc ttgaagaactatgacccccagaaggacaagcgcttctcaggcaccatcaggctgaagtcc actccccggcccaagttctccgtgtgtgtcctggaggaccagcagcactgtgacgaggcc aaggccatgggtatcccccacatggacatcacatggacatcgaggcgctga >gi568815582f:85701206_85905112|GENSCAN_predicted_peptide_5|239_aa MLKGTENVRYSSCSGGSCGLESSLGFTFIITITIIIIIAIITITIITIIIITINIIITIN ITITILIINITITITIITITVLIITITIITIIIITINIIITITIITIIITINITITILIN IITIITIIIITINIIITIIIITITIIITINITITITILIINITITINITTITITILIITT IITIITIIVIITIITIMIIITVTITIIITISIFILIVIATITIVIFIGLKTMVSVLFLL >gi568815582f:85701206_85905112|GENSCAN_predicted_CDS_5|720_bp atgctgaaggggacagaaaatgtgagatacagttcctgttcaggaggatcttgtggtcta gagagtagtcttggcttcaccttcatcatcaccatcaccatcatcattatcattgccatc ataaccatcaccatcatcaccatcatcatcatcaccatcaacatcatcatcaccatcaac atcaccatcaccatcctcatcatcaacatcaccatcaccatcaccatcatcaccatcact gtcctcatcatcaccatcaccatcatcaccatcatcatcatcaccatcaacatcatcatc accatcaccatcatcaccatcatcatcaccatcaacatcaccatcaccatcctcatcaac atcatcaccatcatcaccatcatcatcatcaccatcaacatcatcatcaccatcatcatc atcaccatcaccatcatcatcaccatcaacatcaccatcaccatcaccatcctcatcatc aacatcaccatcaccatcaacatcaccaccatcaccatcaccatcctcatcatcaccacc atcatcaccatcatcaccatcattgtcatcatcaccataatcaccatcatgatcatcatc actgtcaccatcaccattatcatcaccatcagcatcttcatcctcatcgtcattgccacc atcaccattgtcatcttcataggcttaaagacaatggtgtctgttctgtttctcctgtaa >gi568815582f:85701206_85905112|GENSCAN_predicted_peptide_6|616_aa MNCGRATPESRILQLIWKQQMYLRQQEGFWKHAGGMRREAAEGLCHRKPLLTNVKQKVAI DRSYRCLQLQQKKYRHVESIPTPGKEGRERNWENEGAPTVSEMFSFSEANTEKLSCTEQL PPNLCSWRRALVTGHETLREVFEIKIIITDSWGLEQREAGLFMLCFWLHVFLVVGVFLCP SVFPSEMGELGHESPLTKGSSHVCVAWAPPFIEEAGPPSGGASHAGHAWRTCQVPVGGRL SPGRSDSGTAFSDYTAEVECEKEKALPSELNSDPFQEPPVLLLYVLALCWVLFQVPAKCH LTQPSRPSGREAQASQQRLWGVCFMQLLLSNPELAKKLRVYDLASAQTFQSKRLCRPYLR FILQAAGPRHTACSQNSGWLQRKFAHVHLSRETGTSGPNGSYNLEVKCSRELLELMHDSC IQAAKVRLGPACCGSEEQTLEQERWMRRVRSTCSRKRRDEDSPGVEKNKEILKMFIGLAV EGWPHTLLTLLLPYLPWMHVSATCRLEAKDVGQHSWEQTEMAQATGPGCPLENAAWKVHA DVAAENQHPFDHEENYKPVLNKINEDTNKWKNIPRSWIGRINIMKMAILPKVIYRFNAIP SKLRMTFFTELDKTTL >gi568815582f:85701206_85905112|GENSCAN_predicted_CDS_6|1851_bp atgaactgcggcagggctacacccgagagtcgcatcctgcagttaatctggaagcagcag atgtacctgcgtcaacaggaaggattttggaaacatgcagggggcatgagaagagaggct gctgagggtctgtgccataggaagcctctactgacaaatgttaaacagaaggtggcaata gaccgttcctacagatgccttcagctgcagcagaagaaatacaggcatgtggaatccata ccaaccccgggaaaggagggcagggagcggaactgggagaatgaaggggcccccactgta tctgagatgttttcgttctctgaagcaaacacagaaaaactgtcctgcacggagcagctg ccgcccaatctctgctcgtggagacgagcgttggtcactggccatgaaacccttcgtgaa gtttttgaaataaaaataataataacagatagttggggactggagcagcgggaggctggg ctgtttatgctgtgtttttggttgcacgtgtttttggttgttggtgtttttctctgcccc tcggtatttccttccgaaatgggtgaactcgggcatgagtctccactcaccaaaggatcc tcccacgtctgtgtggcctgggcgccgcctttcattgaggaagctgggccgccatctgga ggcgcctcgcacgcgggacacgcatggaggacctgccaggtgcctgttggtggccgcctg tccccagggcgttctgactcagggacagctttcagcgactacacggcagaagtggagtgt gaaaaggaaaaagctttgccctctgagctcaactctgatccctttcaggaaccaccggtc ctcctgttgtacgtccttgccctgtgctgggtgctgttccaggtgcctgccaagtgtcat ctcacgcagccctccaggcctagcgggagggaggcccaggcatctcagcagaggctatgg ggtgtctgcttcatgcagctcctgctgtctaaccccgagctcgccaagaaactgcgtgtt tatgacctggcttctgcacagacatttcaaagcaagagactgtgcagaccctacctgcgc ttcatcctgcaagcagccgggccacgccacacagcctgctcccagaactcagggtggctg caacggaagttcgcacatgtacatttaagtcgagagactgggacttcaggccctaatggc agctataatttggaagtcaaatgcagcagggagctcttagagctgatgcatgattcctgc atacaggcagccaaggtgagattggggcctgcctgctgcggaagtgaggaacagacgttg gagcaggagagatggatgaggagagtccggtctacgtgttcaaggaagcgaagggatgaa gactcacctggtgtggagaagaacaaagaaatcctgaaaatgttcattggtttggcagtg gagggatggccacacacgctcctgaccctgctcttgccatatctgccctggatgcacgtt tcggccacgtgccgcctggaagcaaaggatgtaggtcagcattcatgggagcagacagag atggcccaggctacaggtcccggctgcccattggagaacgcagcttggaaagtacacgct gatgtggccgctgagaaccaacacccctttgaccatgaggagaactacaaaccagtgctc aacaaaataaatgaggacacaaacaaatggaagaacattccacgctcatggataggaaga atcaatatcatgaaaatggccatactgcccaaggtaatttatagattcaatgccatcccc agcaagctacgaatgactttcttcacagaattggataaaactactttatag >gi568815582f:85701206_85905112|GENSCAN_predicted_peptide_7|281_aa MEYYAAIKKDEFMSFVGTRMKLETIILSKLSQGHQTLHVLTHRSEFGIPGFHLAAASPYP IHSWPWACTARLWLGDLSGYRFEEFTCRSPSLSRVYSHEHFEVALGTGGSHVTRDLIPLV ERLLKHSPLLSGLRCTCCTSAKAARPLAHRLERGSKRGNAGGETAAGRRQLVVGVAQEGS LDNNNDIWHTARTQYSIRECGASWASGSPVGSSDNGLLSEGTWMCDRNGGRRLRQWLIEQ IDSSMYPGLIWENEEKSMFRIPWKHAGKQDYNQEVDASIFK >gi568815582f:85701206_85905112|GENSCAN_predicted_CDS_7|843_bp atggaatactatgcagccataaaaaaggatgagttcatgtcctttgtagggacacggatg aagctggaaaccatcattctcagcaaactatcacaaggacatcaaacactgcatgtactc actcataggtctgaatttggaattcctggcttccaccttgcagcagcctccccctacccc atccattcttggccctgggcctgtacagccaggctctggctgggtgatctttccggatac cgctttgaagagtttacctgccgatctccaagtttatcacgagtgtattcccatgaacat ttcgaggtagcactagggacaggtggctcccacgtcaccagggacctcattccacttgta gaaagattgttaaagcacagcccattattaagtggactgagatgcacctgctgcacatct gcaaaggccgcgcgaccgctggcgcatcgcctggagcgcggcagcaagcgtgggaacgcg ggcggcgagacggcggcaggacggcggcagttggtcgtaggagttgcccaggaaggaagt ttggataataataatgacatctggcatacggcacggacgcagtattctatcagggaatgt ggggccagctgggcatctggtagcccagttggttcttcggataatgggctgctcagtgaa ggaacctggatgtgtgaccggaatggtggtcggcggcttcgacagtggctgatcgagcag attgacagtagcatgtatccaggactgatttgggagaatgaggagaagagcatgttccgg atcccttggaaacacgctggcaagcaagattataatcaggaagtggatgcctccattttt aag