GENSCAN 1.0 Date run: 3-Nov-116 Time: 14:30:57 Sequence gi568815591f:107374061_107575359 : 201299 bp : 37.01% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4248 4471 224 2 2 79 72 187 0.649 14.28 1.02 Intr + 5197 5309 113 1 2 -16 98 98 0.036 -0.50 1.03 Intr + 9479 9607 129 0 0 57 62 81 0.616 2.15 1.04 Intr + 12352 12463 112 2 1 40 89 182 0.805 12.02 1.05 Intr + 14060 14272 213 2 0 22 96 101 0.320 1.21 1.06 Term + 14475 14940 466 0 1 13 28 273 0.408 7.30 1.07 PlyA + 16833 16838 6 1.05 2.06 PlyA - 19033 19028 6 1.05 2.05 Term - 29586 29477 110 1 2 73 50 71 0.449 -0.51 2.04 Intr - 29815 29699 117 2 0 72 82 73 0.455 4.62 2.03 Intr - 34318 34234 85 1 1 68 -7 55 0.182 -7.53 2.02 Intr - 38572 38442 131 2 2 63 90 111 0.740 8.29 2.01 Init - 53614 53461 154 1 1 48 84 142 0.920 9.10 2.00 Prom - 89062 89023 40 -2.95 3.00 Prom + 97559 97598 40 -6.85 3.01 Sngl + 100031 101302 1272 1 0 87 44 228 0.962 13.95 3.02 PlyA + 101577 101582 6 1.05 4.00 Prom + 119019 119058 40 -1.35 4.01 Init + 121075 121240 166 0 1 65 92 67 0.601 4.69 4.02 Intr + 131744 131940 197 0 2 98 37 156 0.672 9.71 4.03 Intr + 134946 135151 206 2 2 64 55 195 0.318 10.88 4.04 Intr + 135345 135592 248 0 2 -48 58 184 0.522 -2.02 4.05 Term + 136641 137686 1046 1 2 23 48 638 0.621 44.60 4.06 PlyA + 137692 137697 6 1.05 5.00 Prom + 147116 147155 40 -3.15 5.01 Init + 159081 159368 288 2 0 60 72 238 0.852 16.56 5.02 Intr + 159536 159860 325 1 1 -21 77 199 0.168 2.32 5.03 Term + 169606 170108 503 2 2 41 40 260 0.084 10.16 5.04 PlyA + 170840 170845 6 1.05 6.07 PlyA - 171150 171145 6 1.05 6.06 Term - 173728 173720 9 1 0 100 36 0 0.207 -6.78 6.05 Intr - 174120 174051 70 2 1 124 68 53 0.338 5.17 6.04 Intr - 174272 174218 55 0 1 86 52 67 0.469 0.02 6.03 Intr - 180282 180225 58 0 1 73 71 60 0.564 0.34 6.02 Intr - 184055 183916 140 0 2 92 91 121 0.969 12.06 6.01 Init - 189929 189743 187 2 1 101 103 281 0.986 28.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 4850 5270 421 2 1 13 49 203 0.863 2.68 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:107374061_107575359|GENSCAN_predicted_peptide_1|418_aa MRKNQRKKVENYKNQNASSLPENHNSSPATKQNWMENEFDEMTEAGFRRWVITNSSELKE HVLTQCKEAKNLEKRALEGSTKYGKEKPVPATAKTYQIVKTTDTMEKLHQLTGPPTSRVQ AAFAVPAKVAGTWGCKDRRGKEDTLPSLPHVPQVSGSARQTPAAGAPCEERCGGSTTDRR KEVKSTAQSVSNQNWLNKLSYNAQIASSQAISTRVNPRELEPNQLWQTDVPHNPEFGKLR YVRISIDTNSHLISAHALPRESTRKQKRESMGKDPATLLAQALFTLKFLNLDDKFQSAVE KHFSKTSQGIKPAVSWKDVNSNVRCGPSELLMRGRGHACVHTPLGPLCVPAWRIKPHHGV PRTQPGTRDEGTDPTGPAALNNAASTDDTSLKCHLMLKRITQKAETNSTPDTDTIYSK >gi568815591f:107374061_107575359|GENSCAN_predicted_CDS_1|1257_bp atgaggaaaaaccagcgcaaaaaggttgaaaattacaaaaaccagaatgcctcttctctt ccagaaaatcacaactcctcgccagcaacgaaacaaaactggatggagaatgagtttgat gaaatgacagaagcaggcttcagaaggtgggtaataacaaactcctccgagctaaaggag catgttctaacccaatgcaaggaagctaagaaccttgaaaaaagagctcttgaaggaagc actaaatatggaaaggaaaaaccagtaccagctactgcaaaaacataccaaattgtgaag accacggacactatggagaaactgcatcaattaacaggcccccctacctcaagggtgcaa gctgcttttgcggtgcctgccaaggttgctggaacttgggggtgcaaggacagaagaggg aaagaggacactcttccctctctccctcacgtaccccaggtatctgggtctgccagacag acccccgcagctggtgccccgtgtgaggaacgctgcggaggaagcacgacggaccgccga aaagaagtgaaaagcaccgcgcagtcagtgagtaatcaaaactggctaaacaaattatcc tacaatgcccagattgccagctcacaggcaatctccacacgtgttaaccctagagaatta gaacctaatcagttatggcaaacagacgttccacacaaccctgaatttggaaaacttaga tatgtacgtatatccattgataccaattctcacttaattagtgcccatgctttgcctaga gaatcaacccgaaaacagaaaagggagagtatgggtaaagaccctgcaacactattggca caagccttatttacccttaagtttttaaatttagatgacaaatttcaatcagctgtagaa aagcacttttctaaaacctctcaaggtataaaacctgcagtttcatggaaagatgtgaac agtaatgtacggtgtggtccaagtgaattactaatgcggggaagaggacatgcttgtgtt cacacccccttaggtcctctttgtgttccagcatggcgcatcaaaccacaccatggcgtg cctaggacccaacctggtaccagagatgaaggaactgaccctacaggacccgcagccctg aacaatgcggcttccacagacgacacaagcctcaaatgtcacctgatgctgaagaggata actcagaaggctgaaacgaattctactccagacacagacaccatttactccaaataa >gi568815591f:107374061_107575359|GENSCAN_predicted_peptide_2|198_aa MSPVAGLGLSEASVEASPAGGGPAGKSHREKSSVNCITPGKTLLPSKVTFTDYLSQGIDL SGIEVIENDLLFIARARLEVENQAKRLLEQGLETQLLWSTTIQLENLEISSSKDQHTFCE ETDNKLSLHFYGCYVPNASSLNSPYVEILTAKIMLFGGRPLGEPERAAYPFAMWDHIKKV SVVREGNITHRGLLGLGG >gi568815591f:107374061_107575359|GENSCAN_predicted_CDS_2|597_bp atgtctccagtggcaggcctggggctgagcgaggcttcggtggaggcgtcaccagctgga ggaggtccagctggcaaaagtcaccgagaaaaatccagtgtcaactgtattacacctggg aaaaccctgcttccaagtaaagtcacatttacagattatctttctcaaggaatagatctt tctggaatagaagtgatagaaaatgatctactttttattgcaagagcccgacttgaagtg gaaaatcaagctaagcgcctactagagcagggtttggagactcagctgctgtggagcacc actattcaactggaaaacttggaaatcagctcatctaaggaccagcacactttttgcgaa gagacagataataaattgtcacttcatttttatggctgctatgttcccaatgcatcctcc ttaaattctccttatgttgaaatcctaaccgccaaaattatgttatttggaggtcggcct ttgggagagcctgagagagctgcttatccttttgccatgtgggatcatattaaaaaggtg tccgttgtcagggaagggaacatcacacaccggggcctgttggggttagggggctag >gi568815591f:107374061_107575359|GENSCAN_predicted_peptide_3|423_aa MQSESNITVRDDIDDINTNMYQPLSYPLSFQVSLTGFLMLEIVLGLGSNLTVLVLYCMKS NLINSVSNIITMNLHVLDVIICVGCIPLTIVILLLSLESNTALICCFHEACVSFASVSTA INVFAITLDRYDISVKPANRILTMGRAVMLMISIWIFSFFSFLIPFIEVNFFSLQSGNTW ENKTLLCVSTNEYYTELGMYYHLLVQIPIFFFTVVVMLITYTKILQALNIRIGTRFSTGQ KKKARKKKTISLTTQHEATDMSQSSGGRNVVFGVRTSVSVIIALRRAVKRHRERRERQKR VFRMSLLIISTFLLCWTPISVLNTTILCLGPSDLLVKLRLCFLVMAYGTTIFHPLLYAFT RQKFQKVLKSKMKKRVVSIVEADPLPNNAVIHNSWIDPKRNKKITFEDSEIREKCLVPQV VTD >gi568815591f:107374061_107575359|GENSCAN_predicted_CDS_3|1272_bp atgcagtctgaatctaacattacagtgcgagatgacattgatgacatcaacaccaatatg taccaaccactatcatatccgttaagctttcaagtgtctctcaccggatttcttatgtta gaaattgtgttgggacttggcagcaacctcactgtattggtactttactgcatgaaatcc aacttaatcaactctgtcagtaacattattacaatgaatcttcatgtacttgatgtaata atttgtgtgggatgtattcctctaactatagttatccttctgctttcactggagagtaac actgctctcatttgctgtttccatgaggcttgtgtatcttttgcaagtgtctcaacagca atcaacgtttttgctatcactttggacagatatgacatctctgtaaaacctgcaaaccga attctgacaatgggcagagctgtaatgttaatgatatccatttggattttttcttttttc tctttcctgattccttttattgaggtaaattttttcagtcttcaaagtggaaatacctgg gaaaacaagacacttttatgtgtcagtacaaatgaatactacactgaactgggaatgtat tatcacctgttagtacagatcccaatattctttttcactgttgtagtaatgttaatcaca tacaccaaaatacttcaggctcttaatattcgaataggcacaagattttcaacagggcag aagaagaaagcaagaaagaaaaagacaatttctctaaccacacaacatgaggctacagac atgtcacaaagcagtggtgggagaaatgtagtctttggtgtaagaacttcagtttctgta ataattgccctccggcgagctgtgaaacgacaccgtgaacgacgagaaagacaaaagaga gtcttcaggatgtctttattgattatttctacatttcttctctgctggacaccaatttct gttttaaataccaccattttatgtttaggcccaagtgaccttttagtaaaattaagattg tgttttttagtcatggcttatggaacaactatatttcaccctctattatatgcattcact agacaaaaatttcaaaaggtcttgaaaagtaaaatgaaaaagcgagttgtttctatagta gaagctgatcccctgcctaataatgctgtaatacacaactcttggatagatcctaaaaga aacaaaaaaattacctttgaagatagtgaaataagagaaaaatgtttagtgcctcaggtt gtcacagactag >gi568815591f:107374061_107575359|GENSCAN_predicted_peptide_4|620_aa MAESWGLTNLRCQRTELMAGTATERIGKKHTKEKEVRAIILVPTLPESLAIANYTETGVP KGKNTVNAVVPLGLDAQCGCHTLGCCLECLQGIQRYGLSSSLPAVGTSTSSDGGGRKVTQ TERSSSPAMEQSWKENDFDVLREEGFRRSNYSELKEKVRTNGKEVKNLEKKVDEWLTRIT NAEKSLKELRPSLRLIGVPESDGENGTKLENTLQDIIQENFPNLARQANIQIQEIQRTPQ RCSSRRATPTHVIVRFTKVEMKDKMLRAAREKDHSAIKLELRIKKLTQNRSTTWKLNNLL QNDYWVHNEMKAEIKMFSETNGNKDTTYQYLWHTFKAVCRGKFIALNAHKRKQERSKIDT LTSQLKELEKQEQTHSKASRRQEITKIRAELKEIETQNTVQKINESRSWFFEKINKTDRL LARLIKKKREKNQIDAIKNDKGDITTDPTEIQTTITEYYKHLYANKLDNLEEMDKFLDTY TIPRLKQEEVESLNRPITDSEIVAIINSLPTKKSPGADGFTAKFYQTYKEELVPFLLKLF QSIEKEGILPNSLYEASIILIPKPGRDTTKKESFRPISLMNADAKILNTILANRIQQHIK KLIYHDQVGFIPGMQGWFNI >gi568815591f:107374061_107575359|GENSCAN_predicted_CDS_4|1863_bp atggcagaaagctggggccttactaacttgcggtgtcagaggactgagctcatggctggc acggcaactgaaaggatagggaagaaacacacaaaggaaaaagaggtaagggccataatt ttagtaccaactttgccagaatccttggcgatagctaactacacagaaacaggtgtccct aagggcaagaatactgtgaatgctgttgttcctctgggtctagatgcccagtgtggctgc catactctaggctgctgcttggaatgcctgcaaggaatccagagatatggcctgtcctca agtctcccagcagtgggtaccagtaccagctctgatgggggtggcaggaaagtgacacag actgaacgcagctcctcaccagcaatggaacaaagctggaaggagaatgactttgacgtg ttgagagaagaaggcttcagaagatcaaactactctgagctaaaggagaaagttcgaacc aatggcaaagaagttaaaaaccttgaaaaaaaagtagacgaatggctaactagaataacc aatgcagagaagtccttaaaggagctgagaccaagtctacgtctgattggtgtacctgaa agtgatggggagaatggaaccaagttggaaaacactctgcaggatattatccaggagaac ttccccaatctagcaaggcaggccaacattcagattcaggaaatacagagaacgccacaa agatgctcctcgagaagagcaactccaacacacgtaattgtcagattcaccaaagttgaa atgaaggacaaaatgttaagggcagccagagagaaagaccacagtgcaatcaaactagaa ctcaggattaagaaactcactcaaaaccgctccactacatggaaactgaacaacctgctc cagaatgactactgggtacataacgaaatgaaggcagaaataaagatgttctctgaaacc aacggaaacaaagacacaacataccagtatctctggcacacattcaaagcagtgtgtaga gggaaatttatagcactaaatgcccacaagagaaagcaggaaagatccaaaattgacacc ctaacgtcacaattaaaagaactagaaaagcaagagcaaacacattcaaaagctagcaga aggcaagaaataactaaaatcagagcagaactgaaggaaatagagacacaaaataccgtt caaaaaattaatgaatccaggagctggttttttgaaaagatcaacaaaactgatagactg ctagcaagactaataaagaagaaaagagagaagaatcaaatagatgcaataaaaaatgat aaaggggatatcaccaccgatcccacagaaatacaaactaccatcacagaatactacaaa cacctctacgcaaataaactagacaatctagaagaaatggataaattcctcgacacatac accatcccaagactaaaacaggaagaagttgaatctctgaatagaccaataacagactct gaaattgtggcaataatcaatagcttaccaaccaaaaaaagtccaggagcagatggattc acagccaaattctaccagacgtacaaggaggagctggtaccattccttctgaaactattc caatcaatagaaaaagagggaatcctccctaactcactttatgaggccagcatcatcctg ataccaaagcctggcagagacacaacaaaaaaagagagttttagaccaatatccttgatg aacgctgatgcaaaaatcctcaatacaatactggcaaaccgaatccagcagcacatcaaa aagcttatctaccatgatcaagtgggcttcatccctgggatgcaaggctggtttaacata tga >gi568815591f:107374061_107575359|GENSCAN_predicted_peptide_5|371_aa MQKLHPKVTNSKDQKVDKSTKMRKNQHKKAENAKNQNTSSPPKDHNASPAREQKWMGNQF DKLTEGDFRRRVITNSSKLQEHVLTQCKEAKSFEKKHEEKIREKRMKKNKQSLQEIQDYV KRPNLHLIAVPESDGENGTKLENTLQDIIQENFPNLARQANIQIQEIQRRPLRYSSRRTT PRQIIIRFTKVEMKEKMLRAAQEKDQPKWPQTPGLPRIRLAPVASDTRPAHKDTGSKPAL RVHIPSQPLWPHTPTDTKFRLLWQSQGPGPSQQNLASSGKPRVQAHHSRTQCQACPQDLM THTCLSGPVSRQALADPASRPALTHPASRKTLMAPGTKQAPTDCSLQWTQSPGLFQQTKA PGPPLQTQAPG >gi568815591f:107374061_107575359|GENSCAN_predicted_CDS_5|1116_bp atgcaaaaactccatccaaaggtcaccaacagcaaagaccaaaaggtagataaatccacg aagatgaggaaaaaccagcacaaaaaggctgaaaatgccaaaaaccagaatacctcttct cctccaaaggatcacaacgcctcaccagcaagggaacaaaagtggatggggaatcagttt gacaaattgacagaaggagacttcagaaggcgggtaataacaaactcctccaagctacag gagcatgttctaacccaatgcaaggaagctaagagctttgaaaaaaagcatgaagaaaag atcagagaaaaaagaatgaaaaagaacaaacaaagcctccaagaaatacaggactatgtg aaaagaccaaacctacatttgattgctgtacccgaaagcgatggggagaatggaaccaag ttggaaaacacacttcaagatattatccaggagaacttccctaacctagcaagacaggcc aacattcaaattcaggaaatacagagaagaccactaagatactcctcaagaagaacaacc ccaagacagataatcatcagattcaccaaggttgaaatgaaagaaaaaatgttaagagca gcccaagagaaagaccagcccaagtggccccaaacaccaggtctgccccgcatcaggttg gcacctgtggcctcagacaccaggccagcacacaaggacacaggctccaagcctgcccta cgggtccacattccaagtcagcccctgtggccccacactccaacagacacaaagttcagg ctcctctggcagagccagggtccaggtccatcccagcagaacctggcctcctctggcaaa cccagggtccaggctcatcacagcagaacccagtgccaggcctgcccccaagacctgatg actcacacctgcctcagtggcccagtctccaggcaagcccttgcagacccagcctccagg ccagcactcactcacccagcctctaggaagaccctcatggctccaggcacaaagcaagca cccacagactgcagccttcagtggacccagagtccaggcttatttcagcagaccaaggct ccaggaccacccttgcaaacccaagctccaggctaa >gi568815591f:107374061_107575359|GENSCAN_predicted_peptide_6|172_aa MGWVGGRRRDSASPPGRSRSAADDINPAPANMEGGGGSVAVAGLGARGSGAAAATVRELL QDGCYSDFLNEDFDVKTYTSQSIHQAVIAEQLAKLAQGISQLDRELHLQVVARHEDLLAQ ATGIESLEGVLQMMQTRIGALQGAVDRIKAKIVEPYNKIVARTAQLARLQLL >gi568815591f:107374061_107575359|GENSCAN_predicted_CDS_6|519_bp atgggctgggtgggcgggcggcgccgggattctgcgtcaccacctgggcggagccgttct gctgctgacgacatcaacccggcacctgccaacatggaaggtggcggcggcagcgtcgct gtagctggcctcggagctcgaggctctggagcggctgcagctacagtccgggaacttctg caggacgggtgttatagtgactttttaaacgaagactttgatgtaaagacttatacttct caatctattcatcaagctgtaattgctgaacaactagcaaaacttgcccaaggaatcagt cagttggacagagaactacacttacaggttgttgcaagacatgaagatttactggcacaa gcaactgggattgagtcgttggaaggtgttcttcagatgatgcagacgagaattggggct ttacagggagctgttgataggataaaagcaaaaattgttgaaccatacaataagatagtt gcccggactgcacaactagcaagacttcagctattgtaa