GENSCAN 1.0 Date run: 6-Nov-116 Time: 13:19:25 Sequence gi568815579r:47121805_47332602 : 210798 bp : 50.01% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 3292 3476 185 2 2 86 39 95 0.414 2.11 1.02 PlyA + 3820 3825 6 1.05 2.00 Prom + 4485 4524 40 -2.86 2.01 Init + 9019 9224 206 0 2 34 49 344 0.956 21.42 2.02 Intr + 21690 21801 112 0 1 63 85 74 0.983 4.98 2.03 Intr + 28398 28571 174 2 0 52 87 184 0.996 14.84 2.04 Intr + 31094 31236 143 1 2 119 89 56 0.999 7.95 2.05 Intr + 33310 33409 100 1 1 88 95 97 0.997 10.51 2.06 Intr + 48014 48119 106 1 1 91 100 108 0.991 12.09 2.07 Intr + 49014 49129 116 1 2 101 83 17 0.890 2.67 2.08 Intr + 72266 72406 141 1 0 67 106 60 0.301 6.25 2.09 Intr + 72883 72928 46 0 1 46 76 23 0.506 -5.12 2.10 Intr + 75429 75573 145 1 1 96 77 65 0.866 5.54 2.11 Intr + 81867 81936 70 0 1 131 93 54 0.943 9.38 2.12 Term + 87355 87447 93 0 0 120 43 113 0.993 7.83 2.13 PlyA + 88366 88371 6 1.05 3.06 PlyA - 88711 88706 6 1.05 3.05 Term - 100114 99998 117 1 0 136 43 67 0.804 5.54 3.04 Intr - 104950 104760 191 2 2 91 100 290 0.992 29.80 3.03 Intr - 106642 106354 289 1 1 156 113 184 0.983 24.52 3.02 Intr - 118140 118007 134 1 2 27 52 67 0.045 -2.64 3.01 Init - 118996 118951 46 1 1 80 110 36 0.586 3.99 3.00 Prom - 126719 126680 40 -5.46 4.00 Prom + 128975 129014 40 -6.16 4.01 Init + 136370 136420 51 1 0 86 44 27 0.514 -0.74 4.02 Intr + 136755 136859 105 2 0 57 49 231 0.949 16.51 4.03 Intr + 138517 138618 102 0 0 91 97 157 0.999 17.27 4.04 Intr + 138784 139035 252 0 0 113 97 61 0.991 7.13 4.05 Intr + 142799 142882 84 1 0 99 60 195 0.999 17.82 4.06 Intr + 142969 143142 174 0 0 115 94 336 0.996 37.04 4.07 Intr + 144807 144988 182 2 2 47 76 329 0.999 26.27 4.08 Intr + 148603 148649 47 1 2 117 101 22 0.592 4.45 4.09 Intr + 148749 148884 136 1 1 88 75 81 0.998 6.43 4.10 Intr + 149278 149383 106 1 1 115 94 58 0.996 9.32 4.11 Intr + 149470 149845 376 0 1 74 76 399 0.998 31.89 4.12 Intr + 150265 150357 93 2 0 67 75 50 0.741 1.54 4.13 Intr + 151553 151602 50 0 2 129 69 81 0.058 8.70 4.14 Intr + 153434 153540 107 1 2 23 60 145 0.167 4.21 4.15 Intr + 153921 154072 152 0 2 114 77 -36 0.169 -2.29 4.16 Intr + 155997 156066 70 1 1 113 101 23 0.056 4.34 4.17 Intr + 178207 178306 100 1 1 21 119 -17 0.000 -5.19 4.18 Intr + 188266 188394 129 0 0 105 66 17 0.396 2.09 4.19 Term + 197977 199026 1050 0 0 118 48 1560 0.992 147.14 4.20 PlyA + 200243 200248 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 151553 151656 104 0 2 129 47 115 0.935 9.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:47121805_47332602|GENSCAN_predicted_peptide_1|61_aa XPGMALGRVIYTVSEEAFTEKFLQSVEEALPFGTNAWRTKEETIGQREAFPFCWNLLTTN P >gi568815579r:47121805_47332602|GENSCAN_predicted_CDS_1|186_bp nttcctgggatggctctagggagggttatttacactgtgagtgaggaggcattcacagag aagttcctccagagtgtggaagaagcactaccatttggcaccaacgcctggagaacaaag gaggagaccattggccaaagagaggcatttccattttgttggaacctgctgaccaccaac ccatag >gi568815579r:47121805_47332602|GENSCAN_predicted_peptide_2|483_aa MRRSTPGVLPAAVGGARVRRAVGLSGTGAEAGRAGAMVEKEEAGGGISEEEAAQYDRQIR LWGLEAQKRLRASRVLLVGLKGLGAEIAKNLILAGVKGLTMLDHEQVTPEDPGAQFLIRT GSVGRNRAEASLERAQNLNPMVDVKVDTEDIEKKPESFFTQFDAVCLTCCSRDVIVKVDQ ICHKNSIKFFTGDVFGYHGYTFANLGEHEFVEEKTKVAKVSQGVEDGPDTKRAKLDSSET TMVKKKVVFCPVKEALEVDWSSEKAKAALKRTTSDYFLLQEFFLCLKTILTLMKGGVVFP LAINELLISPSLLVVKRLLASVEPPVVASKTVSSSVVLPPIAFTDAETKIHSLAIGPLAP RSVWHKAGYLWTRCSGQLEGVVLLKFRTDKGRDPSSDTYEEDSELLLQIRNDVLDSLGIS PDLLPEDFVRYCFSEMAPVCAVVGGILAQEIVKALSQRDPPHNNFFFFDGMKGNGIVECL GPK >gi568815579r:47121805_47332602|GENSCAN_predicted_CDS_2|1452_bp atgcgcagaagcactccgggcgtgctgccggcggcggtaggtggcgcgcgggtccggcgg gcggttggcttgagcgggaccggagctgaggcaggaagagccggcgccatggtggagaag gaggaggctggcggcggcattagcgaggaggaggcggcacagtatgaccggcagatccgc ctgtggggactggaggcccagaaacggctgcgggcctctcgggtgcttcttgtcggcttg aaaggacttggggctgaaattgccaagaatctcatcttggcaggagtgaaaggactgacc atgctggatcacgaacaggtaactccagaagatcccggagctcagttcttgattcgtact gggtctgttggccgaaatagggctgaagcctctttggagcgagctcagaatctcaacccc atggtggatgtgaaggtggacactgaggatatagagaagaaaccagagtcatttttcact caattcgatgctgtgtgtctgacttgctgctccagggatgtcatagttaaagttgaccag atctgtcacaaaaatagcatcaagttctttacaggagatgtttttggctaccatggatac acatttgccaatctaggagagcatgagtttgtagaggagaaaactaaagttgccaaagtt agccaaggagtagaagatgggcccgacaccaagagagcaaaacttgattcttctgagaca acgatggtcaaaaagaaggtggtcttctgccctgttaaagaagccctggaggtggactgg agcagtgagaaagcaaaggctgctctgaagcgcacgacctccgactactttctccttcaa gaattctttctctgcctcaaaaccatcctcacgcttatgaaaggtggggttgtttttcct cttgctattaatgaactacttatcagtccatctttgctggtggtcaagagactgttagct tcagtggagccacctgttgtagcttctaaaactgtgtcctcctcagttgttctcccaccc attgccttcacagatgcagagaccaagattcactcactcgccattggacctttagcaccc aggtctgtctggcacaaagcaggttacctgtggacaaggtgtagtgggcagttagaaggt gttgtgctcttaaagttccgtacagataaaggaagagatcccagttctgatacatatgag gaagattctgagttgttgctccagatacgaaatgatgtgcttgactcactgggtattagt cctgacctgcttcctgaggactttgtcaggtactgcttctccgagatggccccagtgtgt gcggtggttggagggattttggcacaggaaattgtgaaggccctgtctcagcgggaccct cctcacaacaacttcttcttcttcgatggcatgaaggggaatgggattgtggagtgcctt ggccccaagtga >gi568815579r:47121805_47332602|GENSCAN_predicted_peptide_3|258_aa MRPGTLAQGGGSVPTAPRGPVPRPPRRGLPQIRADAAVANERARRLRELGRLCPPGIGHA APGSAMARARQEGSSPEPVEGLARDGPRPFPLGRLVPSAVSCGLCEPGLAAAPAAPTLLP AAYLCAPTAPPAVTAALGGSRWPGGPRSRPRGPRPDGPQPSLSLAEQHLESPVPSAPGAL AGGPTQAAPGVRGEEEQWAREIGAQLRRMADDLNAQYERRRQEEQQRHRPSPWRVLYNLI MGLLPLPRGHRAPEMEPN >gi568815579r:47121805_47332602|GENSCAN_predicted_CDS_3|777_bp atgaggcctgggaccctggcgcaaggtggggggtcggtgccaacagctccccggggcccg gttccccgccccccgcgccgagggctcccccagatccgggcggacgcggcggtggccaat gagcgcgctcgccgtctccgggagctgggccggctctgcccgcctggcattgggcacgcg gccccagggagcgccatggcccgcgcacgccaggagggcagctccccggagcccgtagag ggcctggcccgcgacggcccgcgccccttcccgctcggccgcctggtgccctcggcagtg tcctgcggcctctgcgagcccggcctggctgccgcccccgccgcccccaccctgctgccc gctgcctacctctgcgcccccaccgccccacccgccgtcaccgccgccctggggggttcc cgctggcctgggggtccccgcagccggccccgaggcccgcgcccggacggtcctcagccc tcgctctcgctggcggagcagcacctggagtcgcccgtgcccagcgccccgggggctctg gcgggcggtcccacccaggcggccccgggagtccgcggggaggaggaacagtgggcccgg gagatcggggcccagctgcggcggatggcggacgacctcaacgcacagtacgagcggcgg agacaagaggagcagcagcggcaccgcccctcaccctggagggtcctgtacaatctcatc atgggactcctgcccttacccaggggccacagagcccccgagatggagcccaattag >gi568815579r:47121805_47332602|GENSCAN_predicted_peptide_4|1121_aa MEVAVERRESEGYNSLGAATLDLKSKEEKDAELDKRIEALRRKNEALIRRYQEIEEDRKK AELEGVAVTAPRKGRSVEKENVAVESEKNLGPSRRSPGTPRPPGASKGGRTPPQQGGRAG MGRASRSWEGSPGEQPRGGGAGGRGRRGRGRGSPHLSGAGDTSISDRKSKEWEERRRQNI EKMNEEMEKIAEYERNQREGVLEPNPVRNFLDDPRRRSGPLEESERDRREESRRHGRNWG GPDFERVRCGLEHERQGRRAGLGSAGDMTLSMTGRERSEYLRWKQEREKIDQERLQRHRK PTGQWRREWDAEKTDGMFKDGPVPAHEPSHRYDDQAWARPPKPPTFGEFLSQHKAEASSR RRRKSSRPQAKAAPRAYSDHDDRWETKEGAASPAPETPQPTSPETSPKETPMQPPEIPAP AHRPPEDEGEENEGEEDEEWEDISEDEEEEEIEVEEGDEEEPAQDHQAPEAAPTGIPCSE QAHGVPFSPEEPLLEPQAPGTPSSPFSPPSGHQPVSDWGEEVELNSPRTTHLAGALSPGG GQSAPAFPESGPSLRGTQEAEEEGSEATPEAGPEGQETAEITDFQRLQLPLSRRRRYSDP DRRPSRQTPRETPEAAEGRRPGDASPPKFVWERPSWGVAFVCFVPGGLFPFLKELFFSWS REKQLSSSRAFEGFKSPSWDPLGIASQSKALATSKCLAHPPASSCLYPSPGAYCGLWALL RPPVVLEFLALTLQGWDVAMGIRLSRLWEQQQPRGAGAREDSPGEGTLELGLDSFNYTTP DYGHYDDKDTLDLNTPVDKTSNTLRVPDILALVIFAVVFLVGVLGNALVVWVTAFEAKRT INAIWFLNLAVADFLSCLALPILFTSIVQHHHWPFGGAACSILPSLILLNMYASILLLAT ISADRFLLVFKPIWCQNFRGAGLAWIACAVAWGLALLLTIPSFLYRVVREEYFPPKVLCG VDYSHDKRRERAVAIVRLVLGFLWPLLTLTICYTFILLRTWSRRATRSTKTLKVVVAVVA SFFIFWLPYQVTGIMMSFLEPSSPTFLLLKKLDSLCVSFAYINCCINPIIYVVAGQGFQG RLRKSLPSLLRNVLTEESVVRESKSFTRSTVDTMAQKTQAV >gi568815579r:47121805_47332602|GENSCAN_predicted_CDS_4|3366_bp atggaggtggcagttgaaaggagggagagtgagggctacaattctcttggggcagccaca ctcgatttgaaatcaaaggaggagaaggatgctgagttggacaagaggatcgaggctctt cggcggaagaatgaggccctcatccggcgctaccaggagattgaggaagaccgtaagaaa gctgaacttgagggagtcgcagtcacagctccccgaaagggccgctcagtggagaaggag aacgtggcagtggagtcggagaagaacctgggtccttcccggaggtctcctgggacccct cggcccccaggggccagcaaggggggccggactcctccacagcagggaggccgggccggc atgggccgagcatcgcgcagctgggagggcagccccggggagcagcctcgaggaggagga gctgggggccgtggccggaggggccggggccgaggttcacctcacctctctggagctgga gacacctcaatctctgaccgtaaatccaaggagtgggaggagcggcgcaggcagaacatt gagaagatgaatgaggagatggagaagatcgccgagtatgagcgcaaccagcgggaaggg gttcttgaacccaacccagtgcggaacttcctggacgacccccggcgacgcagcgggccc ctggaggagtctgagcgggaccgccgggaggagagccgccggcacggccgcaactggggg ggccccgacttcgagcgggtgcgctgtggccttgagcacgagcggcagggccgccgagct ggcctgggcagtgctggagacatgacgttgtccatgacgggccgggagcggtcggagtac ctgcgctggaagcaggagagggagaagatcgaccaggagcggctgcagaggcaccgcaag cccactggccagtggaggcgcgagtgggatgccgagaagaccgatgggatgttcaaggat ggcccagtccctgcccatgaaccatcccaccgctatgatgaccaggcctgggcccggccc ccgaagccccctacttttggggagttcctgtcccagcacaaagctgaggccagcagccgc agaaggagaaagagcagtcggccccaggccaaggcagcgcccagggcctacagtgaccat gatgaccgctgggagacaaaagaaggggcagcatccccagcccctgagactccacagcct acttcccccgagacttcccccaaggagacacccatgcagccacccgagatcccagctcct gcccaccggcctcctgaagacgagggggaagagaatgagggggaagaggatgaagaatgg gaggacataagtgaggatgaggaagaggaggagatcgaggtggaagaaggtgatgaggag gaaccagcccaagaccaccaagccccagaggctgcccccaccgggatcccctgcagtgag caggcccacggagtccccttcagtccggaggagcccctgctggagccccaggcccctggc acgccttccagccctttctcaccacccagcggccaccagcctgtgtccgattggggtgaa gaggtggagctgaattctccccggaccactcacctggctggcgccctctccccgggaggt ggccagtcagcccctgccttcccggagagtgggcccagcctccgaggaacccaggaagct gaagaggaagggtctgaggcaactccagaggcaggccccgaaggccaggagacggcggag atcaccgacttccagaggctccagctgccgctgagccgccgccgccgctacagcgaccct gaccgccgtccgagccgccagacacccagagagacgccagaggccgcggaggggcgaaga cccggagatgcctcccctcccaagtttgtgtgggaaagaccttcctggggtgtggccttt gtctgttttgtcccgggtgggctgttccccttccttaaggagctcttcttttcttggagt agggagaagcagcttagcagcagcagggcctttgaagggtttaaaagccccagctgggat cctctggggatagcaagtcagtccaaagccctggccaccagcaaatgcttggcgcatcct ccagcttcctcctgcctgtatccctcccccggtgcttattgcggcctctgggccttactc agaccccccgtggtgctagagtttctggctctcacactccagggctgggatgtggccatg ggaataagattgtcaagattgtgggagcaacagcaacctcgtggggctggggcccgagaa gattccccaggggaggggacccttgagttgggtctagactccttcaattataccacccct gattatgggcactatgatgacaaggataccctggacctcaacacccctgtggataaaact tctaacacgctgcgtgttccagacatcctggccttggtcatctttgcagtcgtcttcctg gtgggagtgctgggcaatgccctggtggtctgggtgacggcattcgaggccaagcggacc atcaatgccatctggttcctcaacttggcggtagccgacttcctctcctgcctggcgctg cccatcttgttcacgtccattgtacagcatcaccactggccctttggcggggccgcctgc agcatcctgccctccctcatcctgctcaacatgtacgccagcatcctgctcctggccacc atcagcgccgaccgctttctgctggtgtttaaacccatctggtgccagaacttccgaggg gctggcttggcctggatcgcctgtgccgtggcttggggtttagccctgctgctgaccata ccctccttcctgtaccgggtggtccgggaggagtactttccaccaaaggtgttgtgtggc gtggactacagccacgacaaacggcgggagcgagccgtggccatcgtccggctggtcctg ggcttcctgtggcctctactcacgctcacgatttgttacactttcatcctgctccggacg tggagccgcagggccacgcggtccaccaagacactcaaggtggtggtggcagtggtggcc agtttctttatcttctggttgccctaccaggtgacggggataatgatgtccttcctggag ccatcgtcacccaccttcctgctgctgaagaagctggactccctgtgtgtctcctttgcc tacatcaactgctgcatcaaccccatcatctacgtggtggccggccagggcttccagggc cgactgcggaaatccctccccagcctcctccggaacgtgttgactgaagagtccgtggtt agggagagcaagtcattcacgcgctccacagtggacactatggcccagaagacccaggca gtgtag