GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:47:58 Sequence gi568815596r:147835513_148075958 : 240446 bp : 35.63% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1607 1769 163 1 1 73 28 107 0.243 1.83 1.02 Term + 2359 2576 218 2 2 105 48 99 0.807 3.92 1.03 PlyA + 2717 2722 6 1.05 2.07 PlyA - 2769 2764 6 1.05 2.06 Term - 6229 6145 85 0 1 112 37 78 0.336 1.25 2.05 Intr - 8328 8150 179 0 2 64 57 44 0.285 -3.30 2.04 Intr - 8957 8817 141 2 0 24 64 141 0.783 4.93 2.03 Intr - 9283 9061 223 0 1 -2 2 358 0.670 15.41 2.02 Intr - 32820 32542 279 2 0 85 4 133 0.018 0.27 2.01 Init - 36885 36842 44 0 2 51 121 34 0.625 2.94 2.00 Prom - 42827 42788 40 -1.85 3.00 Prom + 46492 46531 40 -4.65 3.01 Init + 54456 54519 64 2 1 41 67 59 0.600 0.46 3.02 Intr + 60789 60996 208 1 1 108 110 88 0.956 10.21 3.03 Intr + 63946 64055 110 1 2 103 94 -34 0.852 -2.29 3.04 Intr + 64232 64386 155 0 2 104 108 100 0.996 12.47 3.05 Intr + 79679 79822 144 1 0 69 91 50 0.783 2.96 3.06 Intr + 81771 81914 144 2 0 88 93 132 0.996 13.26 3.07 Intr + 82935 83080 146 2 2 110 74 33 0.912 2.26 3.08 Intr + 84718 84832 115 1 1 52 110 68 0.993 4.93 3.09 Intr + 87461 87599 139 1 1 95 91 84 0.999 8.52 3.10 Intr + 90519 90649 131 1 2 120 87 83 0.998 10.89 3.11 Term + 91568 91762 195 1 0 107 45 103 0.983 4.33 3.12 PlyA + 92408 92413 6 1.05 4.18 PlyA - 93706 93701 6 1.05 4.17 Term - 96129 95978 152 1 2 18 49 104 0.082 -3.41 4.16 Intr - 97059 97038 22 0 1 104 93 0 0.105 -1.70 4.15 Intr - 100186 100037 150 1 0 114 48 104 0.400 8.44 4.14 Intr - 102701 102634 68 0 2 56 116 54 0.978 2.71 4.13 Intr - 102881 102786 96 0 0 103 81 20 0.767 1.86 4.12 Intr - 103736 103628 109 2 1 116 81 76 0.998 8.64 4.11 Intr - 108010 107924 87 1 0 102 10 86 0.714 1.45 4.10 Intr - 112625 112539 87 2 0 94 70 43 0.647 2.35 4.09 Intr - 117012 116861 152 1 2 86 111 56 0.986 6.66 4.08 Intr - 119883 119835 49 0 1 87 94 22 0.529 0.03 4.07 Intr - 122871 122786 86 1 2 56 78 102 0.968 4.62 4.06 Intr - 123354 123279 76 0 1 35 115 66 0.607 2.17 4.05 Intr - 124261 124225 37 0 1 94 75 18 0.435 -1.55 4.04 Intr - 126665 126598 68 2 2 80 51 109 0.474 3.18 4.03 Intr - 137317 137227 91 0 1 74 53 82 0.006 2.38 4.02 Intr - 142227 142085 143 0 2 54 20 188 0.072 6.83 4.01 Init - 142457 142452 6 2 0 92 72 0 0.961 -0.17 4.00 Prom - 145732 145693 40 -4.15 5.00 Prom + 147048 147087 40 -3.45 5.01 Init + 148271 148421 151 1 1 54 42 138 0.348 6.05 5.02 Term + 167984 168285 302 0 2 28 42 158 0.186 -0.40 5.03 PlyA + 168511 168516 6 1.05 6.00 Prom + 169569 169608 40 -3.75 6.01 Sngl + 180721 180915 195 0 0 65 44 194 0.865 7.61 6.02 PlyA + 181249 181254 6 1.05 7.06 PlyA - 183015 183010 6 1.05 7.05 Term - 184778 184611 168 2 0 60 48 97 0.793 -0.20 7.04 Intr - 186149 185845 305 0 2 4 60 348 0.080 18.98 7.03 Intr - 186346 186255 92 0 2 101 32 37 0.051 -1.88 7.02 Intr - 198319 198188 132 0 0 68 64 77 0.545 2.14 7.01 Intr - 205840 205681 160 2 1 65 116 115 0.917 10.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 127415 127308 108 2 0 73 63 107 0.822 4.97 S.002 Term - 142227 142081 147 0 0 54 42 193 0.898 8.22 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:147835513_148075958|GENSCAN_predicted_peptide_1|126_aa LTCLNICQEAMDPPTDVSSLIQRSPTFLTPGTGFVENNFSMGVLDNFRMKLFDSGENLAL HVREKKSPQLGLPLCSNVRFCKLICFCKLICFCLPFLPQKEEEEEEEEEEENNHSLLKGQ PNSLMF >gi568815596r:147835513_148075958|GENSCAN_predicted_CDS_1|381_bp ctcacttgcttaaatatctgccaagaagcaatggatcctcctactgatgtcagttctcta atccagcggtccccaacctttttgacaccagggaccggttttgtggaaaacaatttttcc atgggggtattggataatttcaggatgaaactgtttgactcaggtgaaaaccttgcttta catgtcagagaaaagaagagccctcagctgggattgccattatgctccaatgttcggttc tgtaaactcatctgcttctgcaaactcatctgcttctgcttgcccttcctgcctcaaaag gaggaggaggaggaagaagaggaagaggaagagaacaaccactctcttctcaaaggccaa cctaactctctcatgttctga >gi568815596r:147835513_148075958|GENSCAN_predicted_peptide_2|316_aa MSVEKYSENMKHNIREFSALSINIIILTASKLLRMLATLYRILFPIKGNLVTSYHSSKVQ TNATTSLYYSSHFSERELGADSTSIEPLTENTEMNEKNLLSSNVKREKLRRRRGGGGSGV RCRPDSLGTDWVRCPDRPSTQTPKVRGPLAAELAAVAARRLRRGEGAAGGSGGGGAGGGQ VLLEARRSGGTLGGSGGGGGGGSGGRAQRACRAGSARRLSGKRGGSGGGLLHTRAVLVTR KEKEKTHGGRAVHPLRIFTPFITMRNLNPFPPFLLHTKHCVENCFARNRKMTENLWSMED VKFPLNEGLRSFRAAF >gi568815596r:147835513_148075958|GENSCAN_predicted_CDS_2|951_bp atgtctgttgaaaagtattctgaaaatatgaagcacaacataagggagttttcagcatta tctataaatattatcatcttgactgcctcaaagctcttgcgtatgctagccactctgtac agaatcctcttccccatcaaagggaacttggtgacatcctatcattcttccaaggtccag acaaatgccactacctccttgtattactcttcacatttcagtgaacgtgaactgggcgcc gattccaccagcattgagccactcacggagaatacagagatgaatgagaaaaacttgtta agttctaacgtgaagagagagaagctgaggcgacgaagaggcggcggcggcagcggcgtc cgctgccgcccggacagtctgggaactgactgggtccgatgtccggaccgaccttcaacc cagacacctaaagtccgaggaccactggcggccgagctcgcggcggtggcggcgaggcgg ctgcggcgcggggaaggggcggcgggcggcagcggcggcggcggtgcaggcggcggccaa gttctgctggaagccaggcgcagcggaggaacgctcggcggcagcggcggcggcggcggc ggcggcagcggcggccgggcgcagcgcgcgtgccgggccgggagcgcgaggagattgtcg gggaagcgaggcgggagcgggggagggcttctgcacacgcgtgctgtcttggtgactaga aaagaaaaagaaaaaactcacggaggaagggctgtccacccgctcaggatattcacgcca tttatcacgatgcgcaacttgaatccattcccgccgttccttttacacacaaagcattgt gttgaaaactgttttgcacgtaataggaagatgactgaaaacctttggagtatggaggat gttaagttccccttaaatgaaggacttagaagctttcgtgctgctttctag >gi568815596r:147835513_148075958|GENSCAN_predicted_peptide_3|516_aa MVLEKNLEIESICLILQMRKQGAILGRSETQECLFFNANWEKDRTNQTGVEPCYGDKDKR RHCFATWKNISGSIEIVKQGCWLDDINCYDRTDCVEKKDSPEVYFCCCEGNMCNEKFSYF PEMEVTQPTSNPVTPKPPYYNILLYSLVPLMLIAGIVICAFWVYRHHKMAYPPVLVPTQD PGPPPPSPLLGLKPLQLLEVKARGRFGCVWKAQLLNEYVAVKIFPIQDKQSWQNEYEVYS LPGMKHENILQFIGAEKRGTSVDVDLWLITAFHEKGSLSDFLKANVVSWNELCHIAETMA RGLAYLHEDIPGLKDGHKPAISHRDIKSKNVLLKNNLTACIADFGLALKFEAGKSAGDTH GQVGTRRYMAPEVLEGAINFQRDAFLRIDMYAMGLVLWELASRCTAADGPVDEYMLPFEE EIGQHPSLEDMQEVVVHKKKRPVLRDYWQKHAGMAMLCETIEECWDHDAEARLSAGCVGE RITQMQRLTNIITTEDIVTVVTMVTNVDFPPKESSL >gi568815596r:147835513_148075958|GENSCAN_predicted_CDS_3|1551_bp atggttctggaaaagaatttggaaattgaatcaatctgcttgattttacagatgcggaaa caaggtgctatacttggtagatcagaaactcaggagtgtcttttctttaatgctaattgg gaaaaagacagaaccaatcaaactggtgttgaaccgtgttatggtgacaaagataaacgg cggcattgttttgctacctggaagaatatttctggttccattgaaatagtgaaacaaggt tgttggctggatgatatcaactgctatgacaggactgattgtgtagaaaaaaaagacagc cctgaagtatatttttgttgctgtgagggcaatatgtgtaatgaaaagttttcttatttt ccggagatggaagtcacacagcccacttcaaatccagttacacctaagccaccctattac aacatcctgctctattccttggtgccacttatgttaattgcggggattgtcatttgtgca ttttgggtgtacaggcatcacaagatggcctaccctcctgtacttgttccaactcaagac ccaggaccacccccaccttctccattactaggtttgaaaccactgcagttattagaagtg aaagcaaggggaagatttggttgtgtctggaaagcccagttgcttaacgaatatgtggct gtcaaaatatttccaatacaggacaaacagtcatggcaaaatgaatacgaagtctacagt ttgcctggaatgaagcatgagaacatattacagttcattggtgcagaaaaacgaggcacc agtgttgatgtggatctttggctgatcacagcatttcatgaaaagggttcactatcagac tttcttaaggctaatgtggtctcttggaatgaactgtgtcatattgcagaaaccatggct agaggattggcatatttacatgaggatatacctggcctaaaagatggccacaaacctgcc atatctcacagggacatcaaaagtaaaaatgtgctgttgaaaaacaacctgacagcttgc attgctgactttgggttggccttaaaatttgaggctggcaagtctgcaggcgatacccat ggacaggttggtacccggaggtacatggctccagaggtattagagggtgctataaacttc caaagggatgcatttttgaggatagatatgtatgccatgggattagtcctatgggaactg gcttctcgctgtactgctgcagatggacctgtagatgaatacatgttgccatttgaggag gaaattggccagcatccatctcttgaagacatgcaggaagttgttgtgcataaaaaaaag aggcctgttttaagagattattggcagaaacatgctggaatggcaatgctctgtgaaacc attgaagaatgttgggatcacgacgcagaagccaggttatcagctggatgtgtaggtgaa agaattacccagatgcagagactaacaaatattattaccacagaggacattgtaacagtg gtcacaatggtgacaaatgttgactttcctcccaaagaatctagtctatga >gi568815596r:147835513_148075958|GENSCAN_predicted_peptide_4|492_aa MQTEVLEKVQSAKGPDRVHILLQPLVTRSYREDRALSKSEAWTTVGESVTHLSELLKRTA LHGESNSVLIIGPRGSGKTMNGKSLLTTGRFDRGAGDVAAEARGPGIPYALKTMQLINHA LKELMEIEEVSENVLQVHLNGLLQINDKIALKEITRQLNLENVVGDKVFGSFAENLSFLL EALKKGDRTSSCPVIFILDEFDLFAHHKNQTLLYNLFDISQSAQTPIAVIGLTCRLYVKI FKEQLSLPAEFPDKVFAEKWNENVQYLSEDRSVQEVLQKHFNISKNLRSLHMLLMLALNR VTASHPFMTAVDLMEASQLCSMDSKANIVHGLSVLEICLIIAMKHLNDIYEEEPFNFQMV YNEFQKFVQRKAHSVYNFEKPVVMKAFEHLQQLELIKPMERTSGNSQREYQLMKLLLDNT QIMNALQKYPNCPTDLSRCKPHSLHPRDEAHLIMVDKLFDVLLDSFCQYFIQLEWQSLKS QETTGAGEDVEK >gi568815596r:147835513_148075958|GENSCAN_predicted_CDS_4|1479_bp atgcagaccgaggtcctagaaaaagtccagtccgccaagggaccagatagggttcatatc ctgctacagcccctggtaacaaggtcctatcgggaggacagggccctttccaagtctgaa gcctggaccacagttggggaatctgtcacacacttaagtgagctgctgaaaagaactgct ctccatggagagagtaactctgtccttattatcggaccccgaggatcaggaaaaactatg aacgggaagtccctcctgactacgggaagatttgatcggggtgcaggagatgtagctgca gaggccagaggacctggaattccctatgctttaaaaaccatgcagttaataaatcatgct ttgaaagaactcatggaaatagaagaagtgagtgaaaatgtattacaagttcacttaaat ggactgctgcagatcaatgacaaaatcgccctaaaggaaatcacaaggcagttaaatctg gaaaatgtagttggagataaagtttttggaagctttgctgaaaacctttcatttcttctg gaagctttaaaaaaaggtgaccgaactagcagttgcccagtgatcttcatattagatgaa tttgatctttttgctcatcataaaaaccaaacacttctctataatctttttgacatttct cagtctgcacagaccccaatagcagttattggtcttacatgtagattgtatgttaaaata tttaaagaacagttatctctacctgcagagtttccagacaaggtttttgctgagaagtgg aatgaaaatgttcagtatctctcagaagatagaagtgtgcaagaagtactacagaagcat ttcaatatcagcaaaaacctgcggtcattacacatgctattgatgcttgctttaaatcga gtaacagcatcgcacccatttatgactgccgtagatctaatggaagcaagccaactgtgt agcatggactcgaaagcaaatattgtacatggtctatcagtcttggaaatctgtcttata atagcaatgaaacatttaaatgacatctatgaggaagagccatttaattttcaaatggtc tataatgagtttcagaagtttgttcaaaggaaagcacattccgtttataattttgaaaaa cctgttgtcatgaaggcttttgaacacttgcagcaattagaattaataaagcccatggaa agaacttcaggaaattcacagagagagtaccagctgatgaaactgcttttggataatact caaattatgaatgctctgcagaaatatcccaactgtcctacagatttgtcaaggtgtaag cctcactccttgcatcccagggatgaagcccacttgatcatggtggataagctttttgat gtgctgctggattcgttttgccagtattttatccagttagaatggcaatcattaaaaagt caggaaacaacaggtgctggagaggatgtggagaaatag >gi568815596r:147835513_148075958|GENSCAN_predicted_peptide_5|150_aa MKEIVDTAKRVGSDVFQHMGLREIQELTDTTPEELTEDKVEIDASEPLPDKPMRKTTIIS IDVENAFDKIQHPFMLKTLNKLRIDRTYLKIIRAIYDKPTANITLNGQKQEAFPLKTGTR KGCPLSPLLFNIVLEVLARAIRQEKEIKGI >gi568815596r:147835513_148075958|GENSCAN_predicted_CDS_5|453_bp atgaaagagattgtagacacggcaaaaagggtggggagtgatgtgtttcaacatatgggt cttagagaaattcaagagctaacagataccacaccagaggaattaacagaagacaaggtg gagatagatgcttctgaaccactgccagacaaaccaatgagaaaaaccacgattatctca atagatgtagaaaacgccttcgacaaaattcaacaccccttcatgctaaaaactctcaat aaactaagaatcgatagaacgtatctcaaaataataagagctatttatgacaaacccacg gccaatatcacactgaatgggcaaaaacaggaagcattccctttgaaaaccggcacaaga aaaggatgccctctctcaccactcctattcaacatagtattggaagttctggccagggca atcaggcaagagaaagaaataaaaggtatttga >gi568815596r:147835513_148075958|GENSCAN_predicted_peptide_6|64_aa MTSSVAGPRRSSKALPKAKLAPKKAMVTVCWSAAGLIHYSFLNPGETITSEKYAQQINEM HLKL >gi568815596r:147835513_148075958|GENSCAN_predicted_CDS_6|195_bp atgaccagctcagtggctggaccgcgaagaagctccaaagcacttcccaaagccaaactt gcaccaaaaaaggccatggtaactgtttgctggtctgctgccggtctgatccactacagc tttctgaatcctggagaaaccattacatctgagaagtatgctcagcaaatcaatgagatg cacttaaaactgtga >gi568815596r:147835513_148075958|GENSCAN_predicted_peptide_7|285_aa XSVIEAEDPEVTFTEIPHHTLLLLPKQVIESRFCSVVSPPTPAELVFNDPSRARCQSAVK AWSAVPENARKLSSRSPDEVPSSIFNFKHWQNSSTVMWMGTPPHTPLPTPTPPSPPPAPP PPHPGAPLLPSLELWRMTQRVIKGGSGGGGGEGGGGELGKWLLLGVVRCLQPGPSSSSSS SSSSSSSSNSSSSSNSSSKGSVLLRGFWFLSLLLSRPKRRRVEGGRRVGGDGGLLSRDRD KSSVCILTRCITRVAHIGYCALHALPLNPPWYSGVALGKEQAGKS >gi568815596r:147835513_148075958|GENSCAN_predicted_CDS_7|858_bp ntttcagtaattgaagctgaggatccagaagtcaccttcacagaaattcctcatcacacc ctcctgttactcccaaaacaagtcatagagtccagattctgtagtgtcgtcagccctcca actcccgctgaactggtgtttaacgatccttcaagagcaagatgccaatctgctgtaaaa gcatggtctgctgtacctgagaatgcgagaaaattaagttcaagatcacctgatgaagta ccttcatcaattttcaacttcaagcattggcagaattcatcaactgttatgtggatgggc accccacctcacacacccctccctactccaacccctccttcaccccccccagcaccaccc cctcctcaccccggagctccactgctgccttccctggagctgtggaggatgacacaaagg gtaataaagggggggagtggaggaggaggaggcgaaggaggaggaggagagctggggaag tggctgctcctgggtgtagtgagatgtctccagccagggccaagcagcagcagtagcagc agcagcagtagcagcagcagcagcaacagcagcagcagcagcaacagcagcagcaaaggg tctgtgttgctaagaggcttttggtttctttctctcctcctctcacggccaaagaggagg agggtggagggagggaggcgagttggaggggacggaggacttctatctagggacagagac aagtcctctgtatgcattttgaccagatgcatcacacgcgttgctcatattggatattgc gcccttcatgccttacctctcaaccctccctggtattctggagtggctttggggaaggag caggcagggaagtcttga