GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:26:49 Sequence gi568815596r:105261286_105486516 : 225231 bp : 43.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.15 Intr - 8420 7987 434 0 2 60 115 683 0.669 61.57 1.14 Intr - 11729 11570 160 2 1 98 85 214 0.997 21.66 1.13 Intr - 12405 12259 147 0 0 103 99 107 0.999 13.73 1.12 Intr - 14418 14275 144 0 0 92 68 233 0.999 22.18 1.11 Intr - 16386 16329 58 2 1 89 109 18 0.998 2.89 1.10 Intr - 19438 19097 342 0 0 24 87 509 0.937 38.95 1.09 Intr - 23113 23031 83 1 2 105 91 59 0.728 6.34 1.08 Intr - 35225 35071 155 0 2 84 72 80 0.896 5.79 1.07 Intr - 37420 37226 195 2 0 97 73 229 0.950 21.69 1.06 Intr - 38277 38214 64 0 1 30 72 38 0.016 -5.31 1.05 Intr - 47033 46198 836 0 2 105 54 1271 0.008 116.51 1.04 Intr - 62025 62007 19 0 1 107 90 29 0.004 1.18 1.03 Intr - 67567 67377 191 2 2 50 45 158 0.024 7.00 1.02 Intr - 68469 68340 130 0 1 54 117 65 0.915 6.27 1.01 Init - 68994 68932 63 0 0 109 88 75 0.996 8.81 1.00 Prom - 70642 70603 40 -7.46 2.00 Prom + 72002 72041 40 -4.96 2.01 Init + 76189 76445 257 0 2 83 94 172 0.257 12.00 2.02 Intr + 81563 81938 376 2 1 34 115 145 0.451 6.82 2.03 Term + 95983 95991 9 0 0 135 28 0 0.030 -3.11 2.04 PlyA + 96295 96300 6 1.05 3.09 PlyA - 97358 97353 6 1.05 3.08 Term - 100149 99998 152 1 2 94 38 153 0.933 8.97 3.07 Intr - 102186 102000 187 0 1 123 101 252 0.999 29.26 3.06 Intr - 106454 106285 170 0 2 63 65 238 0.808 18.67 3.05 Intr - 112448 112274 175 2 1 122 71 252 0.990 26.41 3.04 Intr - 116524 116415 110 2 2 95 78 73 0.938 7.00 3.03 Intr - 116832 116711 122 2 2 56 86 55 0.766 2.24 3.02 Intr - 117272 117095 178 0 1 63 59 66 0.677 0.18 3.01 Init - 118445 118361 85 2 1 73 98 -23 0.329 -1.66 3.00 Prom - 119524 119485 40 -3.66 4.03 PlyA - 121407 121402 6 1.05 4.02 Term - 124430 124236 195 2 0 41 42 145 0.670 2.61 4.01 Init - 125231 125076 156 2 0 82 77 341 0.634 32.31 4.00 Prom - 126367 126328 40 -4.46 5.00 Prom + 126498 126537 40 -4.36 5.01 Init + 126622 126750 129 0 0 57 89 95 0.349 6.65 5.02 Intr + 137127 137297 171 2 0 42 81 124 0.933 7.24 5.03 Intr + 138730 138801 72 0 0 85 97 11 0.525 1.30 5.04 Intr + 149098 149131 34 0 1 88 78 14 0.041 -1.80 5.05 Intr + 157944 157972 29 1 2 82 102 70 0.732 5.63 5.06 Intr + 161700 161798 99 2 0 115 32 32 0.522 0.61 5.07 Term + 162305 162487 183 1 0 77 50 105 0.899 3.14 5.08 PlyA + 164041 164046 6 1.05 6.08 PlyA - 166363 166358 6 1.05 6.07 Term - 170672 170603 70 1 1 66 49 116 0.658 2.91 6.06 Intr - 172987 172885 103 2 1 86 72 38 0.671 1.23 6.05 Intr - 177231 177114 118 0 1 -29 110 132 0.319 3.74 6.04 Intr - 180938 180842 97 1 1 92 97 43 0.290 5.61 6.03 Intr - 192631 192518 114 0 0 65 98 111 0.518 9.36 6.02 Intr - 206121 206058 64 1 1 52 121 29 0.243 0.38 6.01 Init - 207952 207901 52 1 1 97 105 8 0.351 4.77 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 132564 132985 422 1 2 68 37 128 0.814 1.15 S.002 Init + 136220 136276 57 1 0 73 101 32 0.951 3.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:105261286_105486516|GENSCAN_predicted_peptide_1|1007_aa MGLVGGLRIPTMPSSPGGAPARLAAAPRDGKRRLRGRASPGAAGRSGAAGPQAAGRRGTA GAGAGLLSDPQEASYLAAKNGGSQITQQALLHVDCTRSCGQNMSEQSLITDEVLMGKHLV YQEGRQLREADKARADQPVDMMSIKAFTLVSAVERELLMGDKERVNIECVECCGRDLYVG TNDCFVYHFLLEERPVPAGPATFTATKQLQRHLGFKKPVNELRAASALNRLLVLCDNSIS LVNMLNLEPVPSGARIKGAATFALNENPVSGDPFCVEVCIISVKRRTIQMFLVYEDRVQI VKEVSTAEQPLAVAVDGHFLCLALTTQYIIHNYSTGVSQDLFPYCSEERPPIVKRIGRQE FLLAGPGGLGEEDAFVRLIFLGGLGDCVLKLESEIGAVQNKSSVTLGVCGKTEFSGWAMF SLKQVNAVTLFDLQGMFATVAGISQRAPVHWSENVIGAAVSFPYVIALDDEFITVHSMLD QQQKQTLPFKEGHILQDFEGRVIVATSKGVYILVPLPLEKQIQDLLASRRVEEALVLAKG ARRNIPKEKFQVMYRRILQQAGFIQFAQLQFLEAKELFRSGQLDVRELISLYPFLLPTSS SFTRSHPPLHEYADLNQLTQGDQEKMAKCKRFLMSYLNEVRSTEVANGYKEDIDTALLKL YAEADHDSLLDLLVTENFCLLTDSAAWLEKHKKYFALGLLYHYNNQDAAAVQLWVNIVNG DVQDSTRSDLYEYIVDFLTYCLDEELVWAYADWVLQKSEEVGVQVFTKRPLDEQQKNSFN PDDIINCLKKYPKALVKYLEHLVIDKRLQKEEYHTHLAVLYLEEVLLQRASASGKGAEAT ETQAKLRRLLQKSDLYRVHFLLERLQGAGLPMESAILHGKLGEHEKALHILVHELQDFAA AEDYCLWCSEGRDPPHRQQLFHTLLAIYLHAGPTAHELAVAAVDLLNRHATEFDAAQVLQ MLPDTWSVQLLCPFLMGAMRDSIHARRTMQVALGLARSENLIYTYDK >gi568815596r:105261286_105486516|GENSCAN_predicted_CDS_1|3021_bp atgggcctcgtgggagggctgcgcatcccaaccatgccttcgtctcctggtggggcaccg gcgcgcctggcggccgccccacgtgacgggaagcggcggctgcggggtcgggccagccca ggagccgcgggccggagcggggcggcggggccccaggccgcggggcggcgcgggacggcg ggcgccggcgccggactgttgagtgaccctcaggaagcctcctacctggctgctaaaaat gggggctcacagatcacacagcaggccttactgcacgtggactgcacgcgcagctgtggt caaaacatgtctgagcagagtttaatcactgatgaagtgctcatgggcaagcaccttgtc tatcaggaaggaaggcagctgcgggaagctgacaaagccagagcagatcagccagtagac atgatgagcatcaaagcctttacgcttgtctctgctgtggagcgggagctgctgatgggc gacaaggagcgcgtcaacatagagtgcgtggagtgctgcggcagggacctctacgtgggc accaacgactgcttcgtctaccacttcctgttggaggagaggccagtgcctgctgggcca gccacgttcactgccaccaaacagctgcagagacacttgggcttcaagaagcccgtgaac gagctgcgtgcggcctcagcactcaacaggctgctggtgctgtgtgacaactccatcagc ctggtcaacatgctgaacctcgagccagtgccttcgggggcccgcatcaagggggcagcc acgtttgcactgaacgagaaccctgtgagtggggaccccttctgtgtagaagtttgcatc atctctgtcaaacgcagaaccatccagatgtttctggtgtacgaggaccgggtgcagatc gtcaaggaggtgtcgactgccgagcagcccctcgctgtggctgtggacggccacttcctg tgtctggctctgaccactcagtacatcatccacaattacagcacaggcgtctcccaggac ctgtttccctactgcagtgaggagaggccgccgatcgtcaagaggatagggagacaggag ttcctgctggcgggccccggagggctgggtgaggaggatgcatttgtgcgcctgatcttt ttgggtggcctcggtgactgcgtgttgaaactggagagtgagattggcgctgtgcagaat aaatcctcagtgacattgggcgtgtgtggcaagacagagttctctggttgggcgatgttc agccttaagcaggtaaatgcagtgaccctctttgacctccagggcatgtttgccacagtc gcagggatatcccagcgcgcccccgtgcactggtcggagaatgtgattggggcggctgtg tcctttccatacgtcatagcgctcgatgacgaattcatcacagtccacagcatgttggat cagcaacagaagcagacgctgccctttaaggagggccatatcctacaggactttgaagga agagtgatcgttgccacaagtaaaggagtttacatcttggttccattacctttggaaaaa caaatacaggatcttctagcaagccgcagagtagaagaggctttggttttagcaaaagga gcccggaggaacattccaaaggaaaaatttcaggtaatgtacagaaggattctgcagcag gcgggatttatacagtttgcacaacttcagttcctggaagctaaagagctcttcagaagc ggccagcttgatgtccgggagctgatctctctctaccccttcctgttgcccacctcctcc tccttcacccggtcccaccctcctcttcatgagtacgcagacctgaaccagctgacccag ggggaccaggagaagatggccaagtgcaaacgcttcctcatgagctacctgaacgaggtc cgcagcacagaggtagcaaatggctacaaggaggacatcgacacagccttgctcaaactg tatgcagaggctgaccacgacagcctgctggacctcctggtcactgagaacttctgtctt ctgacggacagtgctgcctggctagagaagcacaaaaagtattttgcacttggactgctc tatcattataataaccaagatgctgctgcagttcagttgtgggtgaacattgtgaatggc gatgtccaggactccacacgctcagacctgtatgaatacatcgtggattttcttacctac tgcttagacgaggaactagtgtgggcctatgctgattgggtcctgcagaaaagtgaagag gtcggagttcaggttttcaccaagagacctttggatgaacagcagaagaacagttttaat ccagacgacattatcaattgccttaaaaaataccctaaagcccttgtgaagtatctggaa catcttgtgatagacaagagactgcagaaagaagagtatcacacccacttagctgtgctg tacctggaagaggtgctgctgcagagggcctccgccagtggcaagggtgcagaggccacc gagacgcaggccaagctgcggcggctgctccagaaatctgatttataccgagtccacttt cttctcgagaggctgcagggagctggcctgcccatggagagcgccatcctgcacgggaag ctgggcgagcatgagaaggcgctgcatatcctggtgcacgagctgcaggactttgcagcg gccgaggactactgcctgtggtgctccgagggccgagacccaccccaccgccagcaactc tttcacacgctgctggccatctacctgcatgctggccccactgcccacgagctggccgtg gctgccgtggacctgctgaaccgccacgccaccgaatttgatgcagcccaggtgctgcag atgctgcctgacacctggtcagtgcagctcctctgcccattcctgatgggggccatgagg gacagcatccatgccaggaggaccatgcaggtggctctcggcctggccaggtccgaaaac ttaatctacacctacgataag >gi568815596r:105261286_105486516|GENSCAN_predicted_peptide_2|213_aa MRRLRVTTHASLRPSTSLPQRFLRGALWVADWGLLATTMAGDVGGRSCTDSELLLHPELL SQEFLLLTLEQVGRAGSEGGRVGLPRSSTVDGLRKRPLIVFDGSSTSTSIKVKKTENGDN DRLKPPPQASFTSNAFRKLSNSSSSVSPLILSSNLPVNNKTEHNNNDAKQNHDLTHRKSP SGPVKSPPLSPVGTTPVKLKRAAPKEEAEAMIV >gi568815596r:105261286_105486516|GENSCAN_predicted_CDS_2|642_bp atgcggcgactcagagtgacgacacacgcgagtctccgcccgagtacgtcacttccgcaa cgcttccttcgcggggctttgtgggtagccgactggggtctcctggcgacgaccatggcg ggggatgtgggcggtcgcagctgcacggactcggaactgctgctgcacccggagctgctg tcccaggagttccttctcctcactctggagcaggttgggcgcgccggatcggagggtggg cgggtgggccttcccaggagtagcactgtagatgggttaaggaaaagacccctcatcgta tttgatggaagttcaacaagtacaagcataaaagtgaaaaagacagagaatggagataat gatcgactgaagcctcccccgcaggcaagctttaccagtaatgcctttagaaaattatca aattcctcttcgagtgtttcacccctaattttgtcttccaatttgcctgtgaacaataaa acggaacacaataataatgacgctaaacagaaccatgacttaacgcataggaaaagtcct tcaggccctgtgaagtcgccaccattgtcccctgttggaactactccagtgaagttaaag agagctgctcctaaagaagaggcagaggccatgattgtataa >gi568815596r:105261286_105486516|GENSCAN_predicted_peptide_3|392_aa MEMRKRGPEQRVRLESELGSICKGADVPGQNLDLCMPQPEWDGEHFQVSTLHQPEGKGQH LAGAKAKWFPGDIADQVVLRECGLHLARHADVLFPPSGDLMTGALAVVTSSGFAFLLTLE ISLSVFLEGLLSQPDSDSWVMENILQLFPKRAVLYEVARLHRGTEDLSYKDRHWHEACFH CSQCRNSLVDKPFAAKEDQLLCTDCYSNEYSSKCQECKKTIMPGTRKMEYKGSSWHETCF ICHRCQQPIGTKSFIPKDNQNFCVPCYEKQHAMQCVQCKKPITTGGVTYREQPWHKECFV CTACRKQLSGQRFTARDDFAYCLNCFCDLYAKKCAGCTNPISGLGGTKYISFEERQWHND CFNCKKCSLSLVGRGFLTERDDILCPDCGKDI >gi568815596r:105261286_105486516|GENSCAN_predicted_CDS_3|1179_bp atggagatgaggaagaggggaccagagcagagggtcaggttggaaagcgagttggggtca atctgcaaaggggctgacgtgccaggacaaaacctggatctctgcatgccccaacccgaa tgggatggggagcactttcaggtgtccactctacaccagcctgaagggaaagggcagcat ctggcaggagccaaggcgaaatggttcccaggggatatagcagaccaggttgtcctgaga gaatgcggccttcaccttgcgaggcatgctgatgtgctctttcctccatctggggactta atgaccggagcactggctgtggtgacatctagtggctttgctttcctgttgaccttggag atttctctctcagtttttcttgaaggtctcttgtcccagccagacagtgactcttgggtc atggaaaacattctgcagcttttccccaaacgagcagtcctctatgaggtggctcgcctg cacagggggacagaggacttgtcttacaaggaccggcactggcatgaagcctgtttccac tgctcgcagtgcagaaactcactggtggacaagccctttgctgccaaggaggaccagctg ctctgtacagactgctattccaacgagtactcatccaagtgccaggaatgcaagaagacc atcatgccaggtacccgcaagatggagtacaagggcagcagctggcatgagacctgcttc atctgccaccgctgccagcagccaattggaaccaagagtttcatccccaaagacaatcag aatttctgtgtgccctgctatgagaaacaacatgccatgcagtgcgttcagtgcaaaaag cccatcaccacgggaggggtcacttaccgggagcagccctggcacaaggagtgcttcgtg tgcaccgcctgcaggaagcagctgtctgggcagcgcttcacagctcgcgatgactttgcc tactgcctgaactgcttctgtgacttgtatgccaagaagtgtgctgggtgcaccaacccc atcagcggacttggtggcacaaaatacatctcctttgaggaacggcagtggcataacgac tgctttaactgtaagaagtgctccctctcactggtggggcgtggcttcctcacagagagg gacgacatcctgtgccccgactgtgggaaagacatctga >gi568815596r:105261286_105486516|GENSCAN_predicted_peptide_4|116_aa MTERFDCHHCNESLFGKKYILREESPYCVVCFETLFANTCEECGKPIGCDCKTLVPTWIA EGHPGGPIHSSELSDTFTRKRLLKHSSETPHVAVILSAVKLATKWDSFGISKGIPN >gi568815596r:105261286_105486516|GENSCAN_predicted_CDS_4|351_bp atgactgagcgctttgactgccaccattgcaacgaatctctctttggcaagaagtacatc ctgcgggaggagagcccctactgcgtggtgtgctttgagaccctgttcgccaacacctgc gaggagtgtgggaagcccatcggctgtgactgcaagactcttgttccaacctggattgct gaaggacaccctgggggacccatccatagctctgaactctctgataccttcacccgcaaa cggcttctaaaacactccagtgaaacccctcatgtggctgtcatcctgagtgccgtgaaa ctggctacaaaatgggattcctttggcatctctaaaggaattcccaactga >gi568815596r:105261286_105486516|GENSCAN_predicted_peptide_5|238_aa MKYYAATKKNEIMSFAGTRMELEAIILSKLMQEQKAKYCMFSLGNGPDSPAQTTVPGQPA RESGLEAGGQRHLPAAAGWDPRPAFTQARPVPARMAVPHLASWVSIMESQNLRIKGLHHL SFHSRVHDHSIGSWRGFSELVVFGYLRRHGNPLQLPHSGKQAEDACIGPDSGSGIKRQVF TSITLPDFKLYYKATVTKTAWYWYQNRYIDQWNITEASETTPHIYNHLIFDKPDKQQK >gi568815596r:105261286_105486516|GENSCAN_predicted_CDS_5|717_bp atgaaatactatgcagccacaaaaaagaatgagatcatgtcttttgcaggaacacggatg gagctggaggccattatccttagcaaactaatgcaggaacagaaagccaaatactgcatg ttctcacttggaaatggcccggactcccccgcgcagaccaccgtgccaggacagcccgct cgggagtcgggcctggaagcaggcggacagcgtcacctccccgcagccgccggctgggac ccgcggccagcctttacccaggctcgcccggtccctgcccgcatggcggtgccccacctg gcctcatgggtctccatcatggaatcacagaacttgagaataaaggggcttcatcacctt agcttccactcgagggtccatgaccattctattggcagttggagaggtttttcggaactg gtggtgtttggttacctcagacggcatgggaacccattacaactccctcatagtggaaag caagctgaggatgcttgcattgggcctgactcaggaagtggtatcaagcgtcaggtattt acttccatcacactacctgacttcaaactatattacaaggctacagtaacaaaaacagca tggtactggtaccaaaacagatatatagaccagtggaacataacagaggcctcagaaaca acaccacacatctacaaccatctgatctttgacaaacctgacaaacaacagaaatag >gi568815596r:105261286_105486516|GENSCAN_predicted_peptide_6|205_aa MGSTGNTGPVCPRGAPRVELQALFLLYPEYPDYPGTQEREVSEVSNGASPDSHHTATHGE NLRFGIAGATYSGCTTSYRAETFWNEDLMIYFQTWQLREFLYSQLLPRKRSWLAAAQKVG AIQPGSPPAQQLSLLIGLEDRRQNPKTTVLSDPVGIRLQLQKSNPVAICKGQHGLNDINE CFSSSSSTSSTLAADELRQDPGLLP >gi568815596r:105261286_105486516|GENSCAN_predicted_CDS_6|618_bp atgggaagcactggcaacactgggcctgtttgccctcgcggagcaccccgagttgagctc caagctctgtttcttctctaccctgagtatccagactaccctggcactcaagaaagggaa gtctcagaagtttctaatggagcatcccctgactcccaccatactgctacccatggagag aatttacggtttgggattgctggtgcaacgtacagcggctgcacaacaagctatagagca gagaccttctggaatgaagatcttatgatctactttcagacatggcagctcagagaattt ctttacagccagctcttgcccagaaagcgcagctggctggcagctgcacagaaggttggg gccatccaaccaggcagtccgcctgcacagcagctgtccctgctcatcgggctggaggac agaagacagaaccctaaaaccacagtactttccgatccagttggcattagacttcaattg caaaagagcaaccctgttgccatatgcaagggacagcatggtttgaatgacataaatgaa tgcttcagcagcagcagcagcacatcctccactctggctgctgatgagctccggcaggat cctggcctgctgccctga