GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:27:32 Sequence gi568815583r:64266033_64481371 : 215339 bp : 41.61% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1946 1941 6 1.05 1.02 Term - 22704 22622 83 1 2 53 37 122 0.381 0.48 1.01 Init - 34467 34287 181 0 1 82 87 98 0.873 8.49 1.00 Prom - 42387 42348 40 -3.85 2.00 Prom + 46367 46406 40 -5.95 2.01 Init + 46659 47268 610 2 1 71 60 136 0.321 4.63 2.02 Intr + 50733 50876 144 1 0 84 55 73 0.360 2.93 2.03 Term + 54178 54260 83 2 2 105 37 104 0.454 3.88 2.04 PlyA + 54746 54751 6 1.05 3.00 Prom + 60309 60348 40 -7.05 3.01 Sngl + 61535 62392 858 1 0 88 43 678 0.929 58.88 3.02 PlyA + 62412 62417 6 1.05 4.00 Prom + 62947 62986 40 -6.15 4.01 Sngl + 63040 65283 2244 0 0 44 38 829 0.986 66.75 4.02 PlyA + 65410 65415 6 1.05 5.05 PlyA - 66203 66198 6 1.05 5.04 Term - 100043 99998 46 1 1 114 42 56 0.019 -0.70 5.03 Intr - 110873 110711 163 0 1 62 106 91 0.142 6.41 5.02 Intr - 115006 114926 81 2 0 108 116 78 0.553 11.29 5.01 Init - 115339 115294 46 1 1 78 100 45 0.454 5.60 5.00 Prom - 118422 118383 40 -2.85 6.00 Prom + 121447 121486 40 -7.05 6.01 Init + 121832 121932 101 1 2 101 33 170 0.739 10.95 6.02 Intr + 127914 128083 170 0 2 24 115 164 0.994 11.37 6.03 Intr + 129366 129499 134 1 2 56 113 108 0.998 9.54 6.04 Intr + 131574 131786 213 2 0 72 59 87 0.810 2.19 6.05 Intr + 134711 134789 79 1 1 106 74 73 0.964 5.91 6.06 Intr + 140298 140427 130 1 1 71 52 144 0.992 7.93 6.07 Intr + 143581 143796 216 1 0 86 111 317 0.995 30.50 6.08 Intr + 148053 148179 127 0 1 46 92 116 0.978 7.56 6.09 Intr + 152509 152696 188 0 2 46 116 130 0.960 9.17 6.10 Intr + 157999 158123 125 1 2 41 87 97 0.409 4.21 6.11 Intr + 159508 159599 92 2 2 49 109 55 0.370 2.49 6.12 Intr + 178974 179076 103 2 1 62 84 76 0.275 3.43 6.13 Term + 188965 189032 68 2 2 116 46 73 0.440 3.02 6.14 PlyA + 189775 189780 6 1.05 7.05 PlyA - 190173 190168 6 1.05 7.04 Term - 192458 192358 101 0 2 88 42 92 0.010 1.91 7.03 Intr - 194584 194467 118 1 1 30 105 151 0.010 10.12 7.02 Intr - 204401 204322 80 0 2 92 38 26 0.009 -3.65 7.01 Init - 211480 211420 61 1 1 36 110 71 0.513 5.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 107669 107466 204 2 0 43 48 289 0.902 13.34 S.002 Term + 194476 194588 113 2 2 50 55 229 0.835 13.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:64266033_64481371|GENSCAN_predicted_peptide_1|87_aa MDHPSREKDERQRTTKPMAQRSAHCSRPSGSSSSSGVLMVGPNFRVGKKIGCGNFGELRL EGGKAFVKESGAYCQLLWAFKEQVMLD >gi568815583r:64266033_64481371|GENSCAN_predicted_CDS_1|264_bp atggaccatcctagtagggaaaaggatgaaagacaacggacaactaaacccatggcacaa aggagtgcacactgctctcgaccatctggctcctcatcgtcctctggggttcttatggtg ggacccaacttcagggttggcaagaagataggatgtgggaacttcggagagctcagatta gaaggcggcaaagcttttgtcaaagaaagtggagcttactgccagctgttgtgggctttc aaagagcaggtgatgctggattga >gi568815583r:64266033_64481371|GENSCAN_predicted_peptide_2|278_aa MGKDFMSKTPKAMATKAKIDKWDLIKLNSCCTAKETIIRVNRQPTEWEKIFAIYPSDKGL ISRIYKELKQIYKKKTNNPIKKWAKDINRHFSKEDIYAANKHEKMLIITVIREMQIKTTM RYHLTPVRMVIIKKSGNRCWRGCGEIGMLLHCWWECTLVQPLWKTVWQFLKDLELEIPFD PAIPLLGIYPKGYKSFYYEDTCTHSRHCMEKHHENLCPVLFHSEYPCIQPPVTYPLLAQS IMTLLRGPPYSYNSTETGTRQKNKCINEEVTTLRSTGY >gi568815583r:64266033_64481371|GENSCAN_predicted_CDS_2|837_bp atgggcaaagacttcatgtctaaaacgccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaacagctgctgcacagcaaaagaaactatcatcagagtg aacaggcaacctacagaatgggagaaaatttttgcaatctatccatctgacaaagggcta atatccagaatctacaaagaacttaaacaaatttacaagaaaaaaacaaacaaccccatc aaaaagtgggcaaaggatataaacagacacttctcaaaagaagacatttatgcagccaac aaacatgaaaaaatgctcatcatcactgtcattagagaaatgcaaatcaaaaccacaatg agataccatctcacgccagttagaatggtgatcattaaaaagtcaggaaacagatgctgg agaggatgtggagaaataggaatgcttttacactgttggtgggagtgtacattagttcaa ccattgtggaagacagtgtggcaattcctcaaggatctagaactagaaataccatttgac ccagcaatcccattactgggtatatacccaaagggttataaatcattctactatgaagac acatgcacacattccagacattgtatggaaaagcatcatgaaaatctctgtcctgttctg ttccattctgaatacccgtgcatacagcccccagtcacgtaccccctgcttgctcaatcg atcatgaccctcttacgtggacccccttacagttataattccaccgaaacaggcacacgt caaaaaaacaaatgcattaatgaagaggttacaacgctcagaagtaccggatattga >gi568815583r:64266033_64481371|GENSCAN_predicted_peptide_3|285_aa MGKKQNRKTGNSKTQSASPPPKERSSSPATEQSCMENDFDEVREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSA MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLARQANVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRV TLKGKPIRLTADLSAETLQARREWGQYSTFLKKRIFNPEFHIQPN >gi568815583r:64266033_64481371|GENSCAN_predicted_CDS_3|858_bp atggggaaaaaacagaacagaaaaactggaaactctaaaacgcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctgtatggagaatgattttgac gaggtgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagca atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacactctgcaggat attatacaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaata cagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt accctcaaagggaagcccatcagactaacagcggatctctcggcagaaaccctacaagcc agaagagagtggggccaatattcaacattcttaaagaaaagaattttcaacccagaattt catatccagccaaactaa >gi568815583r:64266033_64481371|GENSCAN_predicted_peptide_4|747_aa MGDFNTLLSTLDRSTRQKVNKDTQELNSALHQADLTDIYRTLHPKSTEYTFFSAPHHTYS KIDHIVGSKTLLSKCKRTEITTNYLSDHSAIKLELRIKNLTQSHSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDILTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYTNKLENLEEMDTFLDTYTLPRLD QEEVESLNRPVTGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKE GILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHD QVGFIPGMQGWFNIRKAINAIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGI DGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSLLLFNIVLEVLARAIRQE KEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFL YTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNI PCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILS QKNKAGGITLPDFKLYYKATVTKTAWY >gi568815583r:64266033_64481371|GENSCAN_predicted_CDS_4|2244_bp atgggagactttaacaccctactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagcggacctaacagacatctacaga actctccatcccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaaactctcctcagcaaatgtaaaagaacagaaatt acaacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaagccactcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggatgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacatcctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatcaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag aaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggggatatcaccacc gatcccacagaaatacaaactaccatcagagaatactacaaacacctctacacaaataaa ctagaaaatctagaagaaatggatacattcctcgacacatacactctcccaagactagac caggaagaagttgaatctctgaatagaccagtaacaggagctgaaattgtggcaataatc aatagtttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagcatcattctgataccaaagccgggcaga gacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaagcaataaatgca atccagcatataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaa aaggcctttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtatt gatgggacgtatttcaaaataataagagctatctatgacaaacccacagccaatatcata ctgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccct ctctcactgctcctattcaacatagtgttggaagttctggccagggcaatcaggcaggag aaggaaataaaaggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagac gacatgattgtttatctagaaaaccccatcgtctcagcccaaaatctccttaagctgata agcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattctta tacaccaacaacagacaaacagagagccaaatcatgagtgaactcccattcacaattgct tcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaag gagaactacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaacatt ccatgctcatgggtaggaagaatcaatatagtgaaaatggccatactgcccaaggtaatt tacagattcaatgccatccccatcaaactaccaatgactttcttcacagaattggaaaaa actactttaaagttcatatggaaccaaaaaagagcccgcattgccaagtcaatcctaagc caaaagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggctaca gtaaccaaaacagcatggtactag >gi568815583r:64266033_64481371|GENSCAN_predicted_peptide_5|111_aa MVRTKADSVPGTYRKVVAARAPRKVLGSSTSATNSTSVSSRKAENKYAGGNPVCVRPTPK WQKGIGEFFRLSPKDSEKENQIPEEAGSSGLGKAKRKACPLQPDHTNDEKE >gi568815583r:64266033_64481371|GENSCAN_predicted_CDS_5|336_bp atggtgcggactaaagcagacagtgttccaggcacttacagaaaagtggtggctgctcga gcccccagaaaggtgcttggttcttccacctctgccactaattcgacatcagtttcatcg aggaaagctgaaaataaatatgcaggagggaaccccgtttgcgtgcgcccaactcccaag tggcaaaaaggaattggagaattctttaggttgtcccctaaagattctgaaaaagagaat cagattcctgaagaggcaggaagcagtggcttaggaaaagcaaagagaaaagcatgtcct ttgcaacctgatcacacaaatgatgaaaaagaatag >gi568815583r:64266033_64481371|GENSCAN_predicted_peptide_6|581_aa MAVAGAVSGEPLVHWCTQQLRKTFGLDVSEEIIQYVLSIESAEEIREYVTDLLQGNEGKK GQFIEELITKWQKNDQELISDPLQQCFKKDEILDGQKSGDHLKRGRKKGRNRQEVPAFTE PDTTAEVKTPFDLAKAQENSNSVKKKTKFVNLYTREGQDRLAVLLPGRHPCDCLGQKHKL INNCLICGRIVCEQEGSGPCLFCGTLVCTHEEQDILQRDSNKSQKLLKKLMSGVENSGKV DISTKDLLPHQELRIKSGLEKAIKHKDKLLEFDRTSIRRTQVIDDESDYFASDSNQWLSK LERETLQKREEELRELRHASRLSKKVTIDFAGRKILEEENSLAEYHSRLDETIQAIANGT LNQPLTKLDRSSEEPLGVLVNPNMYQSPPQWVDHTGAASQKKAFRSSGFGLEFNSFQHQL RIQDQEFQEGFDGGWCLSVHQPWASLLVRGIKRVEGRSWYTPHRGRLWIAATAKKPSPQE VSELQATYRLLRGKDVEFPNDYPSGCLLGCVDLIDCLSQKQFKEQFPDISQESDSPFVFI CKNPQEMVVKFPIKGNPKIWKLDSKIHQGAKKGLMKQNKAV >gi568815583r:64266033_64481371|GENSCAN_predicted_CDS_6|1746_bp atggcggtggctggggcggtgtccggggagccgctggtgcactggtgcacccagcagttg cggaagactttcggcctggatgtcagcgaggagatcattcagtacgttttgtcaattgag agtgctgaagagatacgagaatatgttactgatctcctccagggaaatgaaggcaaaaaa ggtcaattcatagaagaacttataaccaaatggcaaaagaatgatcaggagttgatttcg gatcctttgcagcagtgcttcaaaaaagatgaaattttagatgggcagaaatcaggcgac catctaaagcggggtaggaagaaagggagaaacagacaggaagttcctgcatttactgaa cctgacacgactgcagaggttaaaacaccttttgatttggccaaggcacaagagaacagc aactccgtaaagaagaagacaaagtttgtcaatttatacacaagagagggacaggacagg cttgcagtcctgctccctggtcgtcacccttgtgattgcctgggccagaagcacaagctc atcaataactgtctgatctgtgggcgcattgtctgtgaacaagaaggctcaggcccttgc ttattctgtggcactctggtgtgtactcatgaggaacaagatattttacagcgtgactca aacaagagccagaaactgctaaagaaactcatgtcaggagtggagaattctggaaaggtg gacatctctaccaaggaccttcttcctcatcaagaattgcgaattaagtctggtctggag aaggctatcaagcataaagacaaactgttagagtttgacagaactagtattcgaaggacc caagtcattgatgatgagtcagattactttgccagtgattctaaccaatggttgtccaaa cttgagcgggaaaccttgcagaagcgagaggaggagctgagagaacttcgacacgcctct cgactttctaagaaggtcaccattgactttgcaggaaggaagatcctggaagaagaaaat tcactagcagagtatcatagcagactagatgagacaatacaggccattgccaatggaacc ttgaaccagccactgaccaaattggatagatcttctgaagagcctttgggagttctggta aatcccaacatgtaccagtcccctccccagtgggttgaccacacaggtgcagcctcacag aagaaggctttccgttcttcaggatttggactagagttcaactcatttcagcaccagttg cgaatccaggatcaagaatttcaggaaggctttgatggtggctggtgcctctctgtacat cagccctgggcttctctgcttgtcagagggattaaaagggtggagggcagatcctggtac accccccacagaggacgactttggatagcagccacagctaaaaaaccctcccctcaagaa gtctcagaactccaggctacatatcgtcttcttcgtgggaaagatgtggaatttcctaat gactatccgtcaggttgtcttctgggctgtgtggacctaattgactgcttgtcccagaag caatttaaggagcagtttccagacatcagtcaagagtctgattctccatttgttttcatc tgcaaaaatcctcaggaaatggttgtgaagtttcctattaaaggaaatccaaaaatctgg aaattggattccaagatccatcaaggagcaaagaaggggttaatgaagcagaataaagct gtctga >gi568815583r:64266033_64481371|GENSCAN_predicted_peptide_7|119_aa MKGEKGKGNKKNNEVEELEKGGYFRDTPLFVILRQMSSTCIFSIKNNPPPPPPPPRSGTT RVTRKTRTLDCQALGSGFRACVESGAESLPTPAPDKHYPTELFAAMEMFYICAVQCRSR >gi568815583r:64266033_64481371|GENSCAN_predicted_CDS_7|360_bp atgaaaggagaaaagggaaagggtaacaagaaaaataatgaagtagaagagttagaaaag ggagggtatttcagagatacccccttgttcgtcatacttcggcaaatgtcatcaacgtgc attttcagcattaaaaataacccgccgccaccgccgccgccgccgcgatccgggacaacc agagtcacccggaagacgcggacactggactgccaggccctcggatccggattccgcgct tgtgtggagtctggagccgaaagtttgccaacccctgctccagacaagcactatccaaca gaactttttgcagcgatggaaatgttctatatctgtgctgtccaatgcagaagtcgctag