GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:27:29 Sequence gi568815580r:5190861_5392205 : 201345 bp : 38.88% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5452 5638 187 1 1 54 20 260 0.787 14.57 1.02 Intr + 6466 6550 85 0 1 76 92 89 0.653 6.57 1.03 Intr + 6802 6921 120 2 0 39 34 133 0.034 2.55 1.04 Intr + 15597 15759 163 1 1 86 81 32 0.025 0.41 1.05 Intr + 18876 19111 236 0 2 -13 95 145 0.033 1.51 1.06 Intr + 21342 21447 106 1 1 73 102 14 0.695 -0.35 1.07 Intr + 23375 23485 111 2 0 87 106 34 0.670 3.68 1.08 Term + 28505 28640 136 2 1 88 41 70 0.399 -1.19 1.09 PlyA + 29023 29028 6 1.05 2.07 PlyA - 29336 29331 6 1.05 2.06 Term - 41297 41041 257 0 2 14 38 186 0.516 0.96 2.05 Intr - 42342 42259 84 1 0 82 100 91 0.825 8.57 2.04 Intr - 46571 46454 118 2 1 64 74 126 0.942 7.92 2.03 Intr - 47706 47559 148 2 1 -21 93 220 0.068 10.92 2.02 Intr - 60331 60238 94 2 1 133 56 69 0.117 6.30 2.01 Init - 60544 60514 31 1 1 86 46 39 0.503 -0.55 2.00 Prom - 61686 61647 40 -4.15 3.00 Prom + 66304 66343 40 -2.35 3.01 Init + 68848 68858 11 0 2 88 110 5 0.619 2.85 3.02 Intr + 69651 69865 215 0 2 34 64 145 0.671 4.14 3.03 Term + 71408 71634 227 0 2 84 55 180 0.983 10.26 3.04 PlyA + 73113 73118 6 1.05 4.04 PlyA - 79474 79469 6 1.05 4.03 Term - 80380 79944 437 2 2 17 32 279 0.522 9.66 4.02 Intr - 80641 80509 133 1 1 52 55 70 0.045 -0.60 4.01 Init - 97009 96917 93 1 0 45 70 98 0.557 4.13 4.00 Prom - 98186 98147 40 -3.95 5.10 PlyA - 98977 98972 6 1.05 5.09 Term - 101344 99998 1347 1 0 96 44 1284 0.941 115.28 5.08 Intr - 103141 103112 30 1 0 71 111 23 0.536 0.31 5.07 Intr - 104020 103941 80 2 2 20 92 77 0.904 -0.35 5.06 Intr - 104666 104476 191 1 2 40 61 113 0.910 2.11 5.05 Intr - 105000 104792 209 0 2 39 54 149 0.647 3.55 5.04 Intr - 106251 106213 39 0 0 14 88 104 0.028 0.50 5.03 Intr - 107994 107806 189 0 0 78 28 87 0.015 0.56 5.02 Intr - 132506 132434 73 1 1 66 102 29 0.001 0.59 5.01 Init - 143914 142272 1643 1 2 94 53 553 0.657 44.01 5.00 Prom - 147802 147763 40 -6.95 6.00 Prom + 158423 158462 40 -3.55 6.01 Init + 159254 159329 76 1 1 85 89 71 0.857 8.20 6.02 Intr + 174025 174155 131 2 2 21 70 154 0.045 6.39 6.03 Intr + 179275 179394 120 0 0 9 101 76 0.025 0.77 6.04 Term + 185934 186188 255 2 0 77 43 135 0.689 2.50 6.05 PlyA + 188499 188504 6 1.05 7.02 PlyA - 189503 189498 6 1.05 7.01 Sngl - 192887 192663 225 2 0 101 38 117 0.498 2.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 47865 47757 109 0 1 38 49 132 0.852 3.24 S.002 Term + 134305 134814 510 0 0 11 41 228 0.867 3.79 S.003 Term + 174025 174206 182 2 2 21 47 219 0.804 7.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580r:5190861_5392205|GENSCAN_predicted_peptide_1|381_aa XGADSNAHARTRRQAGQPQTNLTLRQLVATESSKGTKVLHRLLQKDTATSSGAAVAASVL QPPLGDVHCCLGSKALPPLSRDEGRKSKREKGREQEEGHSDYGALEEGATIRFRSQVANP ALKAPKLEGPQGKAQASFWKLSVLHAGLSVPPLSCKNPYVHVCVSSALDYNELLPSLLNY EFLKESWRPRRAGGIIQSNPEDLRTRGANGEVLVQVPRPENQKCQCPKARVDECPSSNTE HQCSLPLPFCFIGHSVDRMMPIHTGPSHNVWEFWEIQFKLRFGRDTAKPYHMTSQVENST SWLPTESLCSALLPYVKITRDSITKHVIFCAACDSGGKMSVHTVMKEELLLPWIPESRAH IPVLLTGLHVCKKQTSTVLSH >gi568815580r:5190861_5392205|GENSCAN_predicted_CDS_1|1146_bp nntggggcggactcaaatgcgcacgcacgcacacgccgccaagccggtcaaccgcagaca aatctgactcttcgccaactagtagccacggagagcagcaaagggaccaaggtccttcac cgcctgctccaaaaagacacggcaacctccagcggcgccgcagtagctgcctctgtgctg cagcctccgctcggcgacgtacactgctgcttagggagcaaagcacttccaccgctttca cgcgacgaaggtcggaagagcaagagagagaaaggaagagagcaagaggaaggacacagt gactatggagcgctcgaggagggcgctactatcaggttccgctcccaggtggcaaatccg gctctgaaagctcctaagcttgaaggtcctcaaggcaaggcccaggcctccttctggaag ctttctgtgctccatgcagggttaagtgtccctcccctgtcttgcaagaatccctatgtt catgtctgtgtgagtagtgcactggactataatgaactgctgccttccttactaaactat gagttcctcaaggaaagctggaggcccaggagagctggtggtataattcagtccaaccct gaagacctgagaaccaggggagccaatggtgaagtcttggtccaagttccaaggcctgag aaccagaaatgccaatgtccaaaagcaagagtggatgaatgtcccagctcaaatacagaa caccagtgttcccttcctctgcctttttgttttattgggcactcagtggaccggatgatg cccatccacactggtccctcccacaacgtgtgggaattctgggagatacaattcaagttg agatttggtagggacacagccaaaccatatcacatgacatcacaagtagaaaattctacg agctggctgcctacagaaagcctgtgttctgctctgcttccctatgtgaaaattaccaga gacagcataacaaagcatgtgattttctgtgctgcctgtgactctggtggcaagatgtca gtgcacacagtgatgaaagaagagcttctgttaccctggatccctgaaagcagagcccat atcccggtcttattgactggacttcacgtttgtaagaaacaaacatctaccgtgttgagc cactaa >gi568815580r:5190861_5392205|GENSCAN_predicted_peptide_2|243_aa MTSVNNGLEGRKTMKGDEEKWKCTPNIPDWREKKHFPHPVYLRSPDTGDPARLFPTHDGE DPLQRVHEAPVLSQDGALPLDVGDYELLRQPANQQLIQTLGPQSRWNNGRRLPECQVLQD ELKLRVVGRLAPSSWKLLHHESQAEKEVNAQAETQKKKEKEFRRSIIKLIKEAPEGEVQL KIIKNMIQDTKGKFISETDSINKKQSQVLEIKDTLREMQNAMESLSNQTEQAEENFRARR QGF >gi568815580r:5190861_5392205|GENSCAN_predicted_CDS_2|732_bp atgaccagtgtgaacaatggactagaagggagaaagaccatgaaaggcgatgaagaaaaa tggaaatgtacccccaatattcccgactggagagagaagaaacacttcccacatcctgtt tatctccgctccccagacaccggggatccggcccgactgtttcctacccacgacggcgaa gacccgctacagcgcgtccacgaagccccagtactgagccaggacggggctctccctctc gatgtaggcgactacgaactgctccgccagccggcaaatcagcagcttatacagacattg gggccacaaagccggtggaacaacgggagacggcttcctgagtgtcaggtcctccaagat gagcttaaacttcgggtggtgggcaggctcgcgccatcttcttggaaacttctgcaccat gagagccaagcagagaaagaagtgaatgcgcaggctgaaacgcaaaagaagaaagaaaaa gaattcagaaggtcgattattaagctaattaaggaggcaccagagggtgaagtccaactt aagataatcaaaaacatgatacaggatacgaaaggaaaattcatcagtgaaacagatagc ataaataaaaaacaatcacaagttctggaaatcaaggacacacttagagaaatgcaaaat gcaatggaaagtcttagcaatcaaactgaacaagcagaagaaaacttcagagctcgaaga caaggcttttga >gi568815580r:5190861_5392205|GENSCAN_predicted_peptide_3|150_aa MATGQRIQNSCFEETQRNSDNIEKEFRIQSYKLNREIEITEKNQAEILKLKSVTGIPKIA SESLNSRINQAEETIRHREDTQTCTGQPLTPKPTLPPVQLCTLSPTGPPTSCIASTNMVN AHSEDGTPASASTLLKLPHLGPPSTVDSKP >gi568815580r:5190861_5392205|GENSCAN_predicted_CDS_3|453_bp atggccacaggacagagaattcaaaatagctgttttgaggaaactcaaagaaattcagat aatatagagaaggaattcagaattcaatcatacaaactaaacagagagattgaaattact gaaaagaatcaagcagaaattctgaagctgaaaagtgtaactggcataccgaagattgca tcagagtctttaaatagcagaattaatcaagcagaagaaacaattaggcacagagaagac acccagacctgcactggccagcccttgacccccaagccaacactacctccagtacaacta tgcacactgtctccaacaggaccccccaccagctgcattgcctccaccaatatggtgaat gcccatagtgaggatggaacccctgcatctgccagcactctgttgaagctgccacacctt ggccctcccagcacagtggactccaaaccttga >gi568815580r:5190861_5392205|GENSCAN_predicted_peptide_4|220_aa MLPTCVEHHMRGTGLDPGDAKAEAVWSLAVRSRDLVPSVPAVPAMTKRGQDTAWAVASEG GSPKPWQLPCGVEATVWKGNVESDPPHRVPTGAPPSRAMRRGPVSSRPQNGRSTDSLHRV PGKAADTQCQPMKAAKRGAIPCKATGMEQPKTMGTHLLHQCDLDVRHGVKGDHFGALRFD CPAGFQTRMGSAALLFWPISPIWNSYIYPVSVPPLYLRSN >gi568815580r:5190861_5392205|GENSCAN_predicted_CDS_4|663_bp atgttgcctacttgtgttgagcatcatatgcgaggcactgggctggaccctggagatgca aaggcagaagcagtttggtccctggctgttagatctagggacttggtgccctctgtccca gctgttccagccatgactaaaagaggccaagatacagcttgggctgttgcttcagagggt ggaagccccaagccttggcagcttccatgtggtgttgaagctacagtgtggaagggaaat gtggagtctgatcccccacacagagtccccactggggcaccacctagtagagctatgaga agagggccagtgtcctccagaccccagaatggtagatccactgacagcttgcaccgtgtg cctggaaaagctgcagacactcaatgccagcccatgaaagcagccaagaggggagctata ccctgcaaagccacagggatggagcagcccaaaaccatgggaacccacctcttgcatcag tgtgacctggatgtgagacatggagtcaaaggggatcattttggagctttaagatttgac tgccctgctggatttcagactcgcatggggtctgcagcccttttgttttggccaatttct cccatttggaacagctatatttacccagtgtctgtgcccccattgtatctaagaagtaac taa >gi568815580r:5190861_5392205|GENSCAN_predicted_peptide_5|1266_aa MDFSCFAFLYLRHRKKFNQKREFCFPRRLYANSFQTLLEKWSRGTKAPRSSHGTCFHGNI ILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFA DDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMGELPFTI ASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKV IYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKA TVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCRENWL AICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGVGKDFMSKTPKA MATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTTREKIFATYSSDKGLISRIYNELKQIY KKKTNNPIKKWAKDMNRHFSKEDIYAAKKHTKKCSSSLAIREMQIKTTMRYHLTPVRMAI IKKSGNNRCSASFHPLLKYPYRKGVKRELKSKAWLCSGFGLASFSGLFVIQGPKLKRGLW PNHGRVVLDVTAEEEAQNKGCSQNLYSDPAHSHLLILNYAFRGNFACERARGTHGSVCAC ARLQVSAGQGAEGAGAVRKAGAGAAGGWRGGPEDARSRRALGHQPAPGCAAAASGAARAS AAEFMAAPARTAAVHRGGGGDGPNPARSAARRAPPLRAAGPAAPPTAAGGGGGRDPAHAL PVASLQQGSRAAAIVGLPGAGEAAAAAYDMGLQDGVLPEFFISMSETIKYNDDDHKTLFL KTLNEQRLEGEFCDIAIVVEDVKFRAHRCVLAACSTYFKKLFKKLEVDSSSVIEIDFLRS DIFEEVLNYMYTAKISVKKEDVNLMMSSGQILGIRFLDKLCSQKRDVSSPDENNGQSKSK YCLKINRPIGDAADTQDDDVEEIGDQDDSPSDDTVEGTPPSQEDGKSPTTTLRVQEAILK ELGSEEVRKVNCYGQEVESMETPESKDLGSQTPQALTFNDGMSEVKDEQTPGWTTAASDM KFEYLLYGHHREQIACQACGKTFSDEGRLRKHEKLHTADRPFVCEMCTKGFTTQAHLKEH LKIHTGYKPYSCEVCGKSFIRAPDLKKHERVHSNERPFACHMCDKAFKHKSHLKDHERRH RGEKPFVCGSCTKAFAKASDLKRHENNMHSERKQVTPSAIQSETEQLQAAAMAAEAEQQL ETIACS >gi568815580r:5190861_5392205|GENSCAN_predicted_CDS_5|3801_bp atggacttttcctgctttgcttttctctacttgagacacagaaagaaattcaatcagaag cgtgaattctgctttcccaggaggctctatgctaacagttttcaaactttacttgagaaa tggtcccgaggaacaaaagcacctagaagcagtcatggcacgtgtttccatggcaatatc atactgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgc cctctctcaccgctcctattcaacatagtgttggaagttctggccagggcaatcaggcag gagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgca gacgacatgattgtttatctagaaaaccccatcgtctcagcccaaaatctccttaagctg ataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattc ttatacaccaacaacagacaaacagagagccaaatcatgggtgaactcccattcacaatt gcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttc aaggagaactacaaaccactgctcaaggaaataaaagaggacacgaacaaatggaagaac attccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggta atttacagattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaa aaaactactttaaagttcatatggaaccaaaaaagagcccgcattgccaagtcaatccta agccaaaagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggct acagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacaga acagagccctcagaaataatgccgcatatctacaactatctgatctttgacaaacctgag aaaaacaagcaatggggaaaggattccctatttaataaatggtgccgggaaaactggcta gccatatgcagaaagctgaaactggatcccttccttacaccttatacaaaaatcaattca agatggattaaagatttaaacgttagacctaaaaccataaaaaccctagaagaaaaccta ggcattaccattcaggacataggcgtgggcaaggacttcatgtccaaaacaccaaaagca atggcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcaca gcaaaagaaactaccatcagagtgaacaggcaacctacaacacgggagaaaattttcgca acctactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttac aagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacagacacttctca aaagaagacatttatgcagccaaaaaacacacgaagaaatgctcatcatcactggccatc agagaaatgcaaatcaaaaccactatgagatatcatctcacaccagttagaatggcaatc attaaaaagtcaggaaacaacaggtgctctgctagctttcatcctttgctgaagtatccc tacagaaaaggagttaagagggagttgaagtcaaaggcttggctgtgctcaggttttggg ttggcttcattctcaggtctgtttgtcattcaagggcccaagctgaagagggggttgtgg cctaaccatggtcgtgttgtgctggacgtcacagcagaggaggaggcgcagaacaaaggc tgctctcaaaacctctactcagacccggcccactctcatctgctgatattaaattacgcg tttcggggaaacttcgcgtgtgagcgagcgcggggaacccacggcagtgtctgtgcctgt gccaggctgcaggtaagcgcgggccagggtgcggaaggggctggggccgtccgcaaggcg ggcgcgggggcggccggcggctggcggggcggcccggaagacgcgcgttctcgccgagct ctggggcaccagccagctccaggatgtgcggcggccgccagcggcgcggcgcgagcctcg gcggccgagttcatggccgcccccgctcgcaccgccgcagtgcaccgcggtgggggtggg gacggcccgaacccagcgcggagcgcggcccgccgggcgccgcccctccgagccgcgggc cccgccgcgccgcctacagccgccggcggaggagggggccgggacccagcgcacgcactc ccggttgcaagcttgcaacaaggaagtcgtgcagccgccattgtcgggctgcccggggca ggggaggcggctgcagctgcttatgatatgggtctgcaggatggagttcttccggagttt ttcatcagtatgtctgaaaccattaaatataatgacgatgatcataaaactctgtttctg aaaacactaaatgaacaacgcctggaaggagaattttgtgatattgctattgtggttgag gatgtgaaattcagagcacacagatgtgttcttgctgcctgcagcacttactttaaaaag cttttcaagaagcttgaggttgatagttcttcggtcatagaaatagattttcttcgttct gatatatttgaagaggtcctgaactacatgtacacagcaaagatttccgtgaaaaaagaa gatgttaacttaatgatgtcatcgggtcagattcttggtatccgatttttggataaactg tgttctcagaagcgtgatgtgtccagtcccgatgaaaacaatggtcagtccaaaagtaag tattgccttaaaataaatcgccccattggagatgctgctgacacccaggatgatgatgta gaggaaatcggggatcaggatgacagtccttctgatgacacagtagaaggcacacccccg agtcaggaggacggcaagtcgcccaccacaacgctcagggttcaggaagcgatcctgaaa gagctggggagtgaggaagttcggaaggtcaattgctacggccaggaagtagaatccatg gagaccccagaatcaaaagacttggggtcccagacccctcaagccttaacatttaatgat gggatgagtgaagtgaaagatgaacagacaccaggctggacaacagccgccagtgacatg aagtttgagtatttgctttatggtcaccatcgggagcagattgcctgccaggcgtgtggg aagacgttttctgatgaaggcagattgaggaagcatgagaaactccacacggcggacagg ccatttgtttgtgaaatgtgcacaaaaggtttcaccacacaggcccacctgaaagaacac ctaaaaatccacacaggatataagccctatagctgtgaggtgtgtggaaaatcatttatc cgtgccccagacttaaagaagcatgagagagttcacagtaatgaaagaccgtttgcgtgc cacatgtgtgacaaagccttcaaacacaagtctcacctcaaggatcatgaaagaagacac agaggggaaaagccttttgtgtgtggctcctgcaccaaggcatttgccaaggcatctgat ctgaaaaggcacgagaacaatatgcacagtgaaaggaagcaggttacccccagtgccatc cagagcgagacagaacagttgcaggcggcagcgatggctgcggaagcagaacagcagctg gagacgatagcctgtagctag >gi568815580r:5190861_5392205|GENSCAN_predicted_peptide_6|193_aa MDEAGSHYPKRTNTGTENQIPHVLTLYSVLYLRGGKDEKDTISGAKELVISGDKALWDTE KEDLSWCLKCSHMHSLYVNSNALTIVLAFAQKLTRGKGIWKLDIHRAENGLSELSPTSGA AFIQQSRLIEKVTTNHKQGEFESQLYETADLGWNVGSTFSFSWEAASSVSFWSPNALYHL AAQHSWTTPLKST >gi568815580r:5190861_5392205|GENSCAN_predicted_CDS_6|582_bp atggatgaagctggaagccattatcctaagcgaactaacacaggaacggaaaaccagata ccacatgttctcactttgtatagtgtactgtatctccgaggaggcaaagatgagaaagat accatctctggtgccaaggagcttgtcatctcaggagacaaggcactgtgggacacggag aaggaggacttgtcttggtgtctgaagtgtagccacatgcacagcctctatgttaatagc aatgctctgacaatagtgctggcctttgctcagaagctcaccagaggaaagggtatttgg aagttagacattcacagggcagagaatggattatcagagttgagccccacatctggagca gcattcatccagcagtcgaggctaattgaaaaagtcaccacaaatcataagcagggggaa tttgaaagtcagctttatgaaacagcagacttgggttggaatgtaggcagcaccttctca tttagttgggaggcagctagctcagtgtccttctggtccccaaatgccttgtatcatttg gctgctcaacattcgtggacaactcctctgaaatcaacctag >gi568815580r:5190861_5392205|GENSCAN_predicted_peptide_7|74_aa MALKKQTKTIDSEGSNPVLCHLLALIPQANYLDTLSLSLPAVIKGSTASYTGGCSCDETL QPYEKVYEQVGHSS >gi568815580r:5190861_5392205|GENSCAN_predicted_CDS_7|225_bp atggccttaaaaaagcaaacaaaaaccatagactctgaaggcagcaatcctgtgctttgc cacttactagctctgatacctcaggccaattatctagacactctgagcctcagtttaccc gcagttataaagggaagcacagccagctacacaggtggttgtagttgtgatgaaaccctt cagccatatgagaaagtatatgagcaagtaggacattcttcgtaa