GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:17:21 Sequence gi568815596f:46518773_46724224 : 205452 bp : 40.91% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5922 6029 108 2 0 73 94 145 0.956 13.87 1.02 Intr + 10183 10341 159 0 0 71 95 113 0.966 9.56 1.03 Term + 11351 11503 153 1 0 30 54 104 0.315 -1.86 1.04 PlyA + 11885 11890 6 1.05 2.00 Prom + 11909 11948 40 -3.35 2.01 Init + 24275 24416 142 1 1 91 115 266 0.970 27.94 2.02 Intr + 24982 25040 59 2 2 97 109 66 0.243 7.28 2.03 Intr + 29858 29939 82 1 1 56 94 39 0.181 -0.41 2.04 Intr + 32103 32198 96 1 0 96 46 87 0.269 4.36 2.05 Intr + 36250 36384 135 2 0 30 50 104 0.184 0.42 2.06 Intr + 36983 37184 202 0 1 29 92 144 0.164 6.32 2.07 Intr + 41331 41458 128 0 2 51 97 86 0.577 5.20 2.08 Intr + 41752 41917 166 2 1 42 61 128 0.470 3.60 2.09 Intr + 45745 45967 223 1 1 63 75 42 0.034 -2.39 2.10 Intr + 57315 57479 165 2 0 116 91 80 0.773 10.34 2.11 Intr + 57789 57884 96 2 0 37 80 124 0.521 5.79 2.12 Term + 61785 61895 111 2 0 124 44 45 0.103 1.28 2.13 PlyA + 64997 65002 6 1.05 3.12 PlyA - 65177 65172 6 1.05 3.11 Term - 89708 89451 258 2 0 -5 42 339 0.900 14.57 3.10 Intr - 98545 98246 300 1 0 104 58 213 0.580 15.91 3.09 Intr - 129632 129488 145 1 1 47 72 84 0.051 2.06 3.08 Intr - 135716 135421 296 2 2 74 84 123 0.089 5.38 3.07 Intr - 138140 138026 115 1 1 105 47 33 0.064 0.43 3.06 Intr - 140131 139913 219 0 0 84 22 185 0.085 8.00 3.05 Intr - 145893 145844 50 0 2 99 92 20 0.028 0.06 3.04 Intr - 148497 148339 159 0 0 82 20 101 0.288 1.96 3.03 Intr - 150732 150576 157 2 1 38 92 116 0.112 6.09 3.02 Intr - 181080 180904 177 2 0 77 48 172 0.231 10.21 3.01 Init - 192617 192559 59 2 2 70 63 64 0.582 2.93 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:46518773_46724224|GENSCAN_predicted_peptide_1|139_aa MDRHHKQKESKTLDGENTAGKGLLVERGLRSACGEQNHTPRAAYGEVGPGWRSEPQAASR GAKGRYEVSKEPWEIALQRQLAAQASIQPATTAHLQNAAQSPTTYIQSPLLFQPQDSLVP TACETPGWRQERSTGLNRS >gi568815596f:46518773_46724224|GENSCAN_predicted_CDS_1|420_bp atggaccgtcatcataagcagaaagaaagtaagacactggatggggagaacactgcgggc aaaggcctgctggtggaaaggggtctgaggtctgcgtgtggagagcagaaccatactcct agagctgcctatggagaagttggcccaggctggaggtcagagccccaggcagcctccaga ggggccaagggtaggtatgaagtcagcaaggagccctgggaaatagcccttcagaggcag ctggctgcccaagcctccatccaacctgccacaactgcacatctccagaatgcagctcag tctcctaccacctacatacagagccctctcctcttccagccccaggattccctggtccct acagcatgtgagacaccaggatggcgccaggagagaagcactgggctgaacagaagctga >gi568815596f:46518773_46724224|GENSCAN_predicted_peptide_2|534_aa MAHGPGALMLKCVVVGDGAVGKTCLLMSYANDAFPEEYVPTVFDHYAVSVTVGGKQYLLG LYDTAGQPRVPSLLSPVSRAWEEPQGSFQASAATGEGGGSGHRKVHYTTGQVHRVKLSAA GSHTGRASLALEEWGRDWTRGGTEQADGGSASFGQGLRPTGESFGYSSTAEKWPVALCDK SVGGGHGEDADLLTCVLTGRPECLLLSEQASMEVASRQAGVKTQVITQRNCWCVKLLDIP AAEAGALLVPIASDPQGGLLCSPVFNWHLIARSEFGSVGLLSFAAKRVGFLGGDFCFIWI LGSRDHVFLPYESMGFQWPGSVVSTSSSIRCLTVGQRWQIALYAFIWPAEWGSSIHIPHG FVGHGLTPLCTLAASSGHRGRQHCSLHLSVVTRVFSLLLGCVLPLSIKRMCEDYDRLRPL SYPMTDVFLICFSVVNPASFQNVKEEWVPELKEYAPNVPFLLIGTQIDLRDDPKTLARLN DMKEKPICVEQGQKLAKEHLQPSALYSLPCASYPADVSTCGRHTAGTLLFRSVH >gi568815596f:46518773_46724224|GENSCAN_predicted_CDS_2|1605_bp atggctcacgggcccggcgcgctgatgctcaagtgcgtggtggtcggcgacggggcggtg ggcaagacgtgcctactcatgagctatgccaacgacgccttcccggaggagtacgtgccc accgtcttcgaccactacgcagtcagcgtcaccgtggggggcaagcagtacctcctagga ctctatgacacggccggacagccccgtgttccatccttgctcagcccggtgagcagagcc tgggaggagccccagggttccttccaggcatcagctgcaacaggagaaggtgggggatcg ggccacagaaaggtccactacacaactgggcaagttcatcgtgtgaaattgagtgctgca gggtcccacacaggaagagcctccttggccctggaggaatggggaagggactggacacga ggagggacagagcaggcagatgggggctctgcttccttcggacagggacttcgtccaaca ggcgagagctttgggtattcatccactgcagaaaagtggcctgtggccctctgtgataag tcggtaggagggggacatggtgaggatgctgacctgctcacttgtgtcttgactgggaga ccggaatgcctcctgttgtctgaacaagctagtatggaagtagcatctaggcaagctggt gttaagacgcaagtaattactcagagaaattgctggtgtgtcaaactgcttgatattcca gctgcagaagcaggcgctcttctggttcccatagcttcagaccctcagggaggtctgctg tgcagtccggtgtttaactggcacctcattgctcgctccgaatttggctccgtgggcctg ttaagctttgcagcaaagcgagttggatttctgggaggagatttctgtttcatctggatc ctggggtcacgtgatcacgttttccttccatacgagagcatgggtttccagtggccagga tctgttgtgtccactagctccagcatccgctgtcttactgttggacagagatggcagatt gcactgtatgcgtttatttggccagcagagtggggctcaagtattcacattccacatggg tttgttggtcatggcttaactcctctgtgcactctggctgcctcatctggacatagagga aggcagcactgcagccttcatctgagtgtagttaccagggtcttctctttgctgcttggc tgcgtgttgcctttgtctattaaaaggatgtgtgaagactatgaccgtctgaggccttta tcttacccaatgaccgatgtcttccttatatgcttctcggtggtaaatccagcctcattt caaaatgtgaaagaggagtgggtaccggaacttaaggaatacgcaccaaatgtacccttt ttattaataggaactcagattgatctccgagatgaccccaaaactttagcaagactgaat gatatgaaagaaaaacctatatgtgtggaacaaggacagaaactagcaaaagagcatttg caaccatctgctctgtattccctcccctgtgcctcttacccagcagatgtcagcacatgt ggacgccacacagcagggactctgctctttcgttcagttcactga >gi568815596f:46518773_46724224|GENSCAN_predicted_peptide_3|644_aa MHDIDTHKENANSNHSEISHLCNCFSLRWPPRDPPRPRSRMPSYWPRTTPFPLGTPRSSN QKNNSTEMHREPTKRKLPLLPLLQQLIIQAQVADAKQEEVLFSLPVTYLELIYSASQPSG QHGCASITASWGLWRRGCSWLTQFRLPFSHVVARDLQASRPVFTIKNSVMARAGLQTVEF SSGSMTTSHIDRDLQTITTDSVSLWPNPPGSQGSRELPGAVQMGQPHGTHGSMEKPGDGA EGANGNYSADVCGNIVKGKGVISRHFSIMVASEGSEFSSQQFTKVLTWPWNAGKLQRNQR PEHSSLGLLRLHVLISSHRNPSLPLEVNLLVGFHLPHPSYNLKTQETRVEMQGTKAAAAI IQTLHTGQGPLGTGPRQGVASRQEGNGICQIMDASDLRKATAKGEMEMQTRTEGADRGGS KGRPPFSLLILQPPFTNSSKYFAMLQGVTKSEKQAVIQLKRPLNSHFSHTILPHDGSPSR NSSSSSRLKTTTLQLGSIPLPGYVNPLPASKPILPPTPNLESRCPFCPPLPFCDWLLGVG LRPTALRLRVGLGRPSYLMGGDKNINIDEKLIPFLMNDVERFRTSVEETTADVVEIAREL ELEQEPEDVTELLQHYDKTGTDEELLLDEKENGLLRWNLLVKMP >gi568815596f:46518773_46724224|GENSCAN_predicted_CDS_3|1935_bp atgcatgacatcgatactcacaaggaaaatgcaaattcaaaccacagtgagatatcccac ctgtgtaactgcttcagtctccggtggcctccccgcgacccaccacgaccccgatcacgg atgccctcctactggccccggacgacccccttccccttaggaaccccacgttcctccaac cagaaaaacaattccacagaaatgcatcgagagcctactaagcgcaagctcccacttcta cccttgctccagcagctcataatccaggcacaggttgccgatgcaaagcaggaagaagtg ctcttttctttgccagtaacctatctggaactcatctacagtgccagccagccatcaggg cagcacgggtgtgcctccatcacagcttcttggggcctttggagaaggggctgcagttgg ctgactcagttcaggttaccattcagccacgtggttgcaagggacctgcaggccagtaga cctgtcttcacaataaagaattctgtcatggccagagcaggtttacagaccgtggagttt tcaagtggctcaatgaccacctcccatatcgacagagacctgcaaactattacaacagac agtgtttcactttggccaaacccaccaggaagccaaggatcaagggagctccctggtgca gtccagatgggccagcctcatgggacacacggcagtatggagaagcctggagacggagct gaaggggcaaatggaaattattcagcagatgtatgtggcaacattgtaaagggtaaagga gtcatcagcagacatttcagcatcatggtcgcctctgaagggtcagaattcagttctcag caatttacaaaagtcttgacttggccctggaatgctggcaagctccagcggaatcagaga cctgagcacagttccctagggctgttgaggcttcatgtgctcatttcctcgcacagaaac ccctccttaccattggaggtaaatctccttgttgggttccacctgccacatcccagctac aacctcaagacccaagaaaccagggttgaaatgcaaggcaccaaggctgctgcagccatc atccaaaccctccacacaggccaggggccgctaggcactggcccaaggcagggtgtagca tccagacaagaggggaatggaatttgccagataatggatgccagtgatttgaggaaggca acagccaagggagaaatggaaatgcaaaccagaacagaaggtgctgacaggggtgggagt aaaggaaggcctcctttcagtttactaattctgcagccgcccttcacaaactccagcaaa tattttgcgatgttgcaaggtgttacgaaaagtgaaaagcaagctgtgattcagctgaag cggccccttaactcacatttttcgcacaccatccttccccacgacggttcccctagccgc aacagcagcagcagcagcaggctgaaaacaacaacactacaacttggaagtattcccctt ccgggctacgtcaacccacttccggcttctaagcccatactcccgcctaccccaaacctg gaaagccgctgccccttttgcccgcccctccctttctgcgactggctgttaggcgtgggt ctccgccccacagccctgcgcctgcgcgtgggcctgggccgcccgtcgtacctgatggga ggagataaaaatatcaacattgacgagaagctgattccattcctcatgaatgatgttgag aggttcaggacttcagtggaggaaaccactgcagatgtggtggaaatagccagagaacta gaattagaacaggagcctgaagatgtgactgaattgctgcaacattatgataaaactgga acagatgaggaattacttctggatgagaaagaaaatggtctcctgagatggaatctactg gtgaagatgccatga