GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:21:37 Sequence gi568815589r:3125045_3495588 : 370544 bp : 36.22% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 2201 2493 293 0 2 78 42 131 0.637 2.02 1.02 PlyA + 3300 3305 6 1.05 2.02 PlyA - 3778 3773 6 1.05 2.01 Sngl - 20634 19798 837 0 0 66 42 315 0.827 20.18 2.00 Prom - 22522 22483 40 -3.65 3.00 Prom + 22916 22955 40 -5.65 3.01 Init + 26346 26538 193 2 1 95 81 37 0.144 2.88 3.02 Intr + 49354 49465 112 2 1 82 93 47 0.010 3.12 3.03 Term + 55803 56430 628 0 1 84 43 280 0.018 15.94 3.04 PlyA + 58214 58219 6 1.05 4.05 PlyA - 58470 58465 6 1.05 4.04 Term - 61160 60979 182 0 2 74 54 149 0.984 6.89 4.03 Intr - 73658 73459 200 1 2 -9 80 173 0.135 4.87 4.02 Intr - 75978 75881 98 0 2 76 31 112 0.265 2.19 4.01 Init - 77227 77126 102 1 0 52 109 50 0.495 3.79 4.00 Prom - 90091 90052 40 -5.05 5.14 PlyA - 90098 90093 6 1.05 5.13 Term - 100236 99998 239 1 2 67 43 218 0.933 10.55 5.12 Intr - 103845 103803 43 0 1 48 116 43 0.004 -0.01 5.11 Intr - 123141 122988 154 2 1 58 94 176 0.828 14.35 5.10 Intr - 132155 131947 209 2 2 79 69 198 0.855 13.95 5.09 Intr - 138040 137891 150 1 0 39 66 155 0.725 7.84 5.08 Intr - 141261 141164 98 1 2 79 69 73 0.925 3.31 5.07 Intr - 145481 145327 155 1 2 29 94 173 0.982 10.79 5.06 Intr - 146074 145959 116 1 2 82 55 58 0.533 0.23 5.05 Intr - 150568 150456 113 2 2 62 107 80 0.503 6.48 5.04 Intr - 152417 152296 122 1 2 76 77 78 0.975 4.72 5.03 Intr - 163206 163087 120 2 0 5 69 126 0.635 1.09 5.02 Intr - 168214 168033 182 1 2 85 78 122 0.981 8.64 5.01 Init - 176570 176502 69 2 0 64 103 74 0.895 7.60 5.00 Prom - 179615 179576 40 -4.85 6.00 Prom + 179737 179776 40 -6.05 6.01 Init + 184868 184979 112 1 1 57 41 134 0.922 6.02 6.02 Intr + 184996 185114 119 2 2 16 92 114 0.405 3.86 6.03 Term + 188429 188665 237 1 0 79 54 338 0.950 24.68 6.04 PlyA + 189615 189620 6 1.05 7.00 Prom + 189785 189824 40 -6.15 7.01 Sngl + 191886 192713 828 2 0 86 42 191 0.705 9.82 7.02 PlyA + 194782 194787 6 1.05 8.05 PlyA - 195191 195186 6 1.05 8.04 Term - 199051 198938 114 1 0 116 38 22 0.524 -2.41 8.03 Intr - 205473 205215 259 2 1 72 98 194 0.768 15.34 8.02 Intr - 221720 221623 98 2 2 78 101 87 0.032 6.89 8.01 Init - 270544 270428 117 1 0 89 110 99 0.493 12.46 8.00 Prom - 271481 271442 40 -3.25 9.00 Prom + 287527 287566 40 -5.05 9.01 Sngl + 294851 295249 399 1 0 40 47 200 0.564 7.11 9.02 PlyA + 296353 296358 6 1.05 10.04 PlyA - 296365 296360 6 1.05 10.03 Term - 323472 323310 163 2 1 68 47 125 0.470 2.83 10.02 Intr - 331196 331012 185 2 2 82 56 92 0.201 3.06 10.01 Init - 332678 332670 9 2 0 72 99 1 0.405 0.25 10.00 Prom - 346154 346115 40 -5.65 11.02 PlyA - 347028 347023 6 1.05 11.01 Sngl - 350963 350703 261 2 0 58 44 195 0.766 7.01 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 56400 56233 168 0 0 91 57 258 0.811 20.58 S.002 Init - 61333 61291 43 1 1 92 92 39 0.877 5.33 S.003 Init - 103863 103803 61 0 1 55 116 47 0.974 5.66 S.004 Init + 110271 110352 82 2 1 48 94 109 0.915 8.68 S.005 Term + 113493 113608 116 1 2 81 32 104 0.949 1.85 S.006 Term - 121393 121346 48 1 0 104 43 50 0.822 -1.57 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:3125045_3495588|GENSCAN_predicted_peptide_1|97_aa XNSSAKPWDLMEQTLKTFARSIMASLPFLLKYWNSLSVSSLLGYIAIGDYPLCSPDTSLE PEAPRLLSHIPLPFHLLLLLHVCNLIISPGVSSAFRL >gi568815589r:3125045_3495588|GENSCAN_predicted_CDS_1|294_bp nataactcctctgcaaaaccttgggatttgatggagcaaactctgaaaacctttgcgcgt agcatcatggccagtctaccatttctgctaaaatactggaactcgttgtctgtatcttca ctgttgggttatatcgctattggggattatcctctgtgctcaccggacacatccctagag ccagaggctcctagactcctatcccacattcctcttcctttccacctgctcctgctcctg catgtatgcaacttaattatttctcctggtgtttcctctgcatttaggctatag >gi568815589r:3125045_3495588|GENSCAN_predicted_peptide_2|278_aa MKKDKGGHYIMVKGLIQQEEVTILNIYAPNIGTPRIIKKVFRDLQRDLDSYTIAVGDFNT ALSILDRSTRQKINKDIQELNSALDQADLIDIYRTLHPKSTEYTIFSAPQRTYSKIDHII GSKTLLSKCRRREMITNSFSDHSAIKLELRIKKLTQNHTTTWKLNNLLVSDYWVNNEIKA EINKFCGMRTKYQNLWDTAKAMFRGIFIALNVHRRKQERSNVDTLTSQLKELEKQEQTNS KASRRQEMTKIRAELKEIETRKTLQKINESRGWLFLKD >gi568815589r:3125045_3495588|GENSCAN_predicted_CDS_2|837_bp atgaaaaaagacaaaggagggcattacataatggtaaagggattaattcaacaagaagag gtaactatcctaaatatatatgcacccaatataggaacacccagaatcataaagaaagtt tttagagacctacaaagagacttagactcctacacaatagcagtgggcgactttaacacc gcactgtcaatattagacagatcaacaagacagaaaattaacaaggatattcaggagttg aactcagctttagaccaagcagacctaatagacatctacagaactctccaccccaaatca acagaatatacgatcttctcagcaccacagcgcacttattctaaaattgatcacataatt ggaagcaaaacactcctcagcaaatgcagaagaagggaaatgataacaaacagtttctct gaccacagtgcaattaaattagaactcaggattaagaaactcactcaaaaccacacaact acatggaaactgaacaacctgctcgtgagtgactactgggtaaataatgaaattaaggca gaaataaataagttctgtgggatgagaacaaagtaccagaatctctgggacacagcaaaa gcaatgtttagagggatatttatagcactaaatgtccataggagaaagcaggaaagatct aatgtcgacaccctaacatcacaattaaaagaactagagaagcaagagcaaacaaattca aaagctagcagaagacaagaaatgactaagatcagagcagaactgaaggagatagagaca cgaaaaacccttcaaaaaatcaatgaatccaggggctggctttttttaaaagattaa >gi568815589r:3125045_3495588|GENSCAN_predicted_peptide_3|310_aa MDVPTEVMQAKPCTVTNLNIGRMPYVHSKTQEDRRLLRNHLEAISHSSPYFSWEAVSFLL PPSSVTSLFPVLTLKIKGRLTMLSTLYSQDSHTSSSSKWHSLPKLVLRARLPTATKFSES NKDTTGSIWTFSGQERQPGISLQRVAGDAHLLWDSRSAQPALLHARAGQHSAPRVHARRP DTRSLVHAQRAHTPALHARAMGAHTRIPTRAHTRAPITALARKRAATSEWDSACGGSARY FTISHLRVRLLPCWLHLQPQRFPAGAPELSRPKELLLRHKAQLHTVLQARGDPGRSRGRS HGPSASAPEP >gi568815589r:3125045_3495588|GENSCAN_predicted_CDS_3|933_bp atggatgtacccacagaggtcatgcaagcaaagccttgtactgtaactaatctcaatatt ggaagaatgccctatgttcattccaaaacacaagaagataggaggctcctgaggaaccat cttgaagccatctcccattcatctccctatttttcttgggaggctgtcagtttcctactg cctccaagttctgtaacctctctgtttccagtattaactctgaaaatcaagggccgacta accatgctgtctacactttactcacaagatagccatacctcttcttcttcgaagtggcat tcactgccgaagctggttctcagagctcgtctgcctacagctactaagttctctgagagc aataaagatacgacgggatccatctggacgttttccggacaggagaggcagcccggcatt tcccttcagagagttgctggggatgcacacctgctctgggacagccggagtgcgcaaccc gcacttctacacgcgcgcgccgggcaacatagcgcaccgcgggtacacgcacgccgaccg gacacacgcagtctcgtacacgcccaacgagcgcacacacccgcactccatgcacgcgcg atgggcgcacacactcgcatccctacacgtgcacacacccgcgctccaattaccgcgcta gcgcgcaaaagagccgcaacttccgagtgggactcggcatgcgggggcagtgcccggtac ttcaccatcagtcaccttcgcgtccgcctcctcccctgctggttgcacctccagccgcag agatttccggctggggcaccagaactgtccaggccgaaagagctgctcctccgtcacaaa gcgcagttgcacactgtactgcaggcgcgcggggacccgggccggagccggggtcgcagc catggtccaagcgcctctgctcctgagccctag >gi568815589r:3125045_3495588|GENSCAN_predicted_peptide_4|193_aa MSRPPSGVIAGEVDWRSGRCGIEVYHFHNNINLSLRPEGPRGDADGETVTPFLQPQLPKK GLLSLIKCSIEWGTKVCDKGSSRAEILGVSSQQHSLSGKSTWQNLSWCFLNSQGQCGKPS CTKYLRRCSGQLTVCKCNTDPVKGLLNKANSLLPGVSRVFFQLPNPHPPTLLRPQLELDP NPDCDTGVVQMLT >gi568815589r:3125045_3495588|GENSCAN_predicted_CDS_4|582_bp atgagcaggcctccaagtggtgtcatagctggagaggtggattggcggagtggcagatgt gggatcgaagtctatcactttcataacaacatcaacctctcgctcagacctgaaggaccc agaggggatgctgatggggaaacggtcactccattcctgcagccccaattacccaagaag gggcttctctcgctcatcaagtgctcgattgaatggggaaccaaggtctgtgacaaaggc agctcccgtgctgagattctaggcgtcagcagccagcagcactcgctgtcaggaaagagc acgtggcagaacctaagctggtgcttcctcaactcccaaggccaatgtggtaaaccatcc tgcacaaaatacctccgcaggtgttcaggacaactgacagtgtgtaaatgtaacactgac cctgtgaaggggctgctgaataaagccaactctcttttacctggtgtctctcgagtgttc ttccagctccccaacccacatccacccactctcctcagacctcagctggagctggacccc aaccctgattgtgacactggagttgtacagatgctcacttga >gi568815589r:3125045_3495588|GENSCAN_predicted_peptide_5|589_aa MAIETLQKSDGLSTHRSSLLNSHLQWLLDNYETAEGVSLPRSTLYNHYLRHCQEHKLDPV NAASFGKLIRSIFMGLRTRRLGTRGNSKYHYYGIRVKPDSPLNRLQEDMQYMAMRQQPMQ QKQRYKPMQKVDGVADGFTGSGQQTGTSVEQTVIAQSQHHQQFLDASRALPEFGEVEISS LPDGTTFEDIKSLQSLYREHCEAILDVVVNLQFSLIEKLWQTFWRYSPSTPTDGTTITES SNLSEIESRLPKAKLITLCKHESILKWMCNCDHGMYQALVEILIPDVLRPIPSALTQAIR NFAKSLEGWLSNAMNNIPQRMIQTKVAAVSAFAQTLRRYTSLNHLAQAARAVLQNTSQIN QMLSDLNRVDFANVQEQASWVCQCDDNMVQRLETDFKMTLQQQSTLEQWAAWLDNVMMQA LKPYEGRPSFPKAARQFLLKWSFYSSMVIRDLTLRSAASFGSFHLIRLLYDEYMFYLVEH RVAQATGETPIAVMGEFGDLNAVSPGNLDKDEGSEVESEMDEELDDSSEPQAKREKTELS QAFPVGCMQPVLETGVQPSLLNPIHSEHIVTSTQTIRQCSATGNTYTAV >gi568815589r:3125045_3495588|GENSCAN_predicted_CDS_5|1770_bp atggcgattgagacgctgcaaaagtctgacggtctgtccactcacagaagctctcttctc aacagccatctccagtggctgttggacaattatgagacagcagaaggagtgagccttccc agaagcactctgtacaaccactaccttcgacactgtcaggaacacaaactggacccagtc aatgctgcctcttttggaaaattaataagatcaatttttatggggctacgaaccaggaga ttgggcactagaggaaactccaaataccactactatgggattcgtgtcaagccagattcc cctcttaatcgtctgcaagaagacatgcagtatatggctatgagacaacaacccatgcaa cagaaacaaaggtacaagcctatgcagaaagtggatggggttgcagatggtttcacagga agtggtcaacagacaggcacatctgttgagcaaactgtaattgcccaaagccaacatcat caacagtttttagatgcatctcgagcacttccagagtttggagaagttgaaatctcttct ctgccagatggtactacctttgaggatatcaagtcactgcagagtctttatagagagcac tgtgaggcaatattggacgttgttgtgaatcttcaatttagcctgatagaaaaattgtgg caaacattctggcgctattctccctctactccaactgatggcactaccattaccgaatcg agcaatctgagtgaaatagaaagtcgacttccgaaagcaaagctgataactctgtgcaaa catgagtctatcctgaaatggatgtgtaactgtgaccatgggatgtaccaggctttggtg gagattctcatccccgacgtccttagacctattcctagtgccttgacccaagccattcga aattttgcaaaaagccttgaaggttggctttccaatgccatgaacaatattccacagaga atgatacaaaccaaggttgccgctgtaagtgcctttgcccagactctgcgaagatacacg tcgcttaatcacctggcccaggcagctcgtgcagtgcttcagaacacttcccaaatcaac cagatgcttagtgacctcaaccgtgtcgactttgccaatgtccaggagcaggcttcctgg gtgtgccagtgtgatgacaacatggttcagagactagaaacagacttcaagatgactctt cagcagcagagcaccctggagcagtgggctgcgtggcttgacaatgtgatgatgcaagca ctgaaaccctatgaaggaagacccagttttcctaaagccgccaggcagtttctgctaaaa tggtctttctacagctcaatggttattcgggacttaaccttacgcagtgctgctagcttt ggctccttccacctgatccgtctactctacgacgaatatatgttttacttagtagaacat cgtgttgctcaggcaacaggagagactcctatagcagtcatgggcgagtttggtgattta aatgccgtgtctcctggaaatctggataaagatgaaggcagtgaagtagaaagtgaaatg gatgaagaactggatgactcttcagagcctcaagccaaaagagagaaaacagagctgagc caggcatttccagtgggctgcatgcagcctgttctcgagactggcgtgcaaccaagcctc ctgaatccaattcacagcgagcacattgtcacaagtactcagactatcagacagtgcagc gctacaggaaatacctacactgcagtctaa >gi568815589r:3125045_3495588|GENSCAN_predicted_peptide_6|155_aa MIPDLERLKVEDHWQNKRTEWGEQGGTKEALCDRLLEVLSPAKSSVKPMPLADASSTSNR DWLEHRDHLGAVDNAKKERSSPPATEQSWMENNFDELREEDFRRSAITNFSELKEDVRTH HKEAKNLEKRLDEWLTRINSLEKSLNDLMELKTMA >gi568815589r:3125045_3495588|GENSCAN_predicted_CDS_6|468_bp atgatcccagacctggaaagactgaaggtggaggatcactggcagaataaaaggacagag tggggagaacaaggtggtacaaaagaggctttgtgtgaccgtcttttagaggtgctttcc cctgcaaaatcttcagtaaaacctatgccacttgcagatgcttctagtacaagtaatcga gactggctggagcacagggatcatttaggggcagttgataatgccaagaaggaacgcagc tccccgccagcaacggaacaaagctggatggagaataattttgacgagttgagagaagaa gacttcagacgatcggcaataacaaacttctctgagctaaaggaggatgttcgaacccat cacaaagaagctaaaaaccttgaaaaaagattagacgaatggctaactagaataaacagt ctagagaaatccttaaatgacctgatggagctgaaaaccatggcatga >gi568815589r:3125045_3495588|GENSCAN_predicted_peptide_7|275_aa MAILPKVIYRFNAIPIKLLLTFFTELEKTTLKFIWNQKRARIAQTILSQKNKAGGIMLPD FKLYYKATVTKTAWYWNQNRDIDQWNRIEPSEIIPHIYNHLIFDKFDKSKKWGKDSLFNK WCWENWLATCRKLKLDLFLTPYAKINSRWIKDLNVRPKTVKALEENLGNTIQDIGMGKDF MTKTPKAMATKAKIDKWVLIKLKSFCTAKETTIRVNRQPTEWEKIFTIYPSDKGLISRIY KELKTNLQDKIKQPHQKVGKGYEQTLLKRRHLCSQ >gi568815589r:3125045_3495588|GENSCAN_predicted_CDS_7|828_bp atggccatactgcccaaggtaatttatagattcaatgccatccccatcaaactactactg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cgcattgcccagacaatcctaagccaaaagaacaaagctggaggcatcatgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggaaccaaaacaga gatatagaccaatggaacagaatagagccctcagaaataataccacacatctataaccat ctgatctttgacaaatttgacaaaagcaagaaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccacatgtagaaagctgaaactggatctcttccttaca ccttatgcaaaaattaattcaagatggattaaagacttaaatgttagacctaaaactgta aaagccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttc atgactaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggttctaata aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacagacaacctaca gaatgggagaaaatttttacaatctacccatctgacaaagggctaatatccagaatctac aaagaacttaaaacaaatttacaagacaaaatcaaacaaccccatcaaaaagtgggcaaa ggatatgaacagacacttctcaaaagaagacatttatgcagccaatag >gi568815589r:3125045_3495588|GENSCAN_predicted_peptide_8|195_aa MQTSETGSDTGSTVTLQTSVASQAAVPTQVVQQVPVQQQVQQVQTVQQVQHVYPAQVQYV EGSDTVYTNGAIRTTTYPYTETQMYSQNTGGNYFDTQGSSAQVTTVVSSHSMVGTGGIQM GVTGGQLISSSGGTYLIGNSMENSGHSVTHTTRASPATANSVPEMKLLRQITMNRLFPTL NHKLTEGSGPHFQSI >gi568815589r:3125045_3495588|GENSCAN_predicted_CDS_8|588_bp atgcagacatcagagactgggtcggacacaggctcgacagtgaccttacaaacatctgtg gctagtcaagcagcagtgcctacgcaggtggtacagcaagtaccagtacaacaacaggta cagcaggtacagactgtgcagcaggtacaacatgtctatcccgctcaggtgcagtatgtg gaaggaagcgatactgtctataccaatggagcaatccgaacaacaacgtatccttacaca gagacacagatgtacagccaaaatactggagggaattactttgatactcaagggagttcc gcccaggtgactaccgtggtctcatcccacagtatggtgggcactggtgggattcagatg ggcgtcacaggaggacaactcatcagcagctctggaggaacctatctgatcggcaactca atggagaattctggtcactcagtgacacacacaactcgggcctccccagcgacagccaac tcagtccctgaaatgaaactcctcagacagattaccatgaaccgtctgtttcctacacta aatcataaattgactgaaggttctgggcctcattttcaaagcatttaa >gi568815589r:3125045_3495588|GENSCAN_predicted_peptide_9|132_aa MQNWLTWKANRKIPTQPLHVRNQSAFSPPPTALKIVRQFVIVAREGTGGRLAKCGRKNVG ARSTTCSSELPPSLRPSFSEPAYIKALTMCAAFFQIFALKHTSQQLSPLGRMIKARKALI HRLSTNRTFYGS >gi568815589r:3125045_3495588|GENSCAN_predicted_CDS_9|399_bp atgcagaactggctcacatggaaggcaaaccgtaaaattccaacacaacctctgcatgtg agaaatcaatcagctttttctcctcctcctaccgcactcaaaattgtaaggcagtttgtc atcgtcgcacgtgaaggcacaggagggagactagctaaatgtggcagaaagaacgtcggg gcccgcagcaccacttgttcctcagaactgcctcccagcctccgaccatccttcagtgag ccagcgtacattaaagctctcaccatgtgtgctgccttcttccagatttttgcactgaag cacaccagtcagcagctttcaccacttgggagaatgatcaaggccaggaaagcactcata cacaggctgagtactaacaggacattctacggttcatga >gi568815589r:3125045_3495588|GENSCAN_predicted_peptide_10|118_aa MILCHGLNVSPPKVRCRNLRANVILFEHGIGHKVMRAPSLVNGSKTPIKTALHSLQERAL LPSAIRKRNMWVVISCCIVGGPQNGQRTHGDTADVANGFGDVKVEMELFGPFNYRLTL >gi568815589r:3125045_3495588|GENSCAN_predicted_CDS_10|357_bp atgattctctgccatggtttgaatgtgtctcctccaaaagtgagatgtagaaacttaagg gccaatgtgatactattcgagcatgggattggtcataaggtcatgagggctccttccctt gtgaatgggagtaaaacccctataaaaacagctttacacagccttcaggagcgtgctctt ctgccttctgccataagaaaaagaaatatgtgggttgtaataagctgctgtattgtggga gggcctcagaatggacagaggactcatggtgatacagctgatgtagcaaatggatttggg gatgtaaaggtcgaaatggaattatttggtccttttaattaccgcttaacactttga >gi568815589r:3125045_3495588|GENSCAN_predicted_peptide_11|86_aa MPGQGHQRAPWSSGNASVWEDACCQADCGLVVASVRRENTYYLADWEGESPFPQGSLEKT LYSTSCGGPDISQACPQLSGGLTVSL >gi568815589r:3125045_3495588|GENSCAN_predicted_CDS_11|261_bp atgcctggacagggccaccagagagctccttggtctagcggtaacgccagcgtctgggaa gatgcctgttgccaagcagactgtggtctagtggtagcgtcagtgcgaagggaaaacacc tactacttagcagactgggaaggggagtctccctttccccaggggagtttagagaagaca ctctactccacctcttgtggagggcctgacatcagtcaagcctgcccacagttatctgga ggcctgactgtctccctgtga